Class CmsExtractorMsOfficeOLE2

java.lang.Object
org.opencms.search.extractors.A_CmsTextExtractor
org.opencms.search.extractors.CmsExtractorMsOfficeOLE2
All Implemented Interfaces:
I_CmsTextExtractor

public final class CmsExtractorMsOfficeOLE2 extends A_CmsTextExtractor
Extracts text data from a VFS resource that is an OLE 2 MS Office document.

Supported formats are MS Word (.doc), MS PowerPoint (.ppt) and MS Excel (.xls).

The OLE 2 format was introduced in Microsoft Office version 97 and was the default format until Office version 2007 and the new XML-based OOXML format.

Since:
8.0.1