Class CmsExtractorMsOfficeOLE2

  • All Implemented Interfaces:
    I_CmsTextExtractor

    public final class CmsExtractorMsOfficeOLE2
    extends A_CmsTextExtractor
    Extracts text data from a VFS resource that is an OLE 2 MS Office document.

    Supported formats are MS Word (.doc), MS PowerPoint (.ppt) and MS Excel (.xls).

    The OLE 2 format was introduced in Microsoft Office version 97 and was the default format until Office version 2007 and the new XML-based OOXML format.

    Since:
    8.0.1