Class CmsExtractorMsOfficeOOXML

  • All Implemented Interfaces:
    I_CmsTextExtractor

    public final class CmsExtractorMsOfficeOOXML
    extends A_CmsTextExtractor
    Extracts text data from a VFS resource that is an OOXML MS Office document.

    Supported formats are MS Word (.docx), MS PowerPoint (.pptx) and MS Excel (.xlsx).

    The OLE 2 format was introduced in Microsoft Office version 97 and was the default format until Office version 2007 and the new XML-based OOXML format.

    Since:
    8.0.1