Class CmsExtractorMsOfficeOOXML

java.lang.Object
org.opencms.search.extractors.A_CmsTextExtractor
org.opencms.search.extractors.CmsExtractorMsOfficeOOXML
All Implemented Interfaces:
I_CmsTextExtractor

Extracts text data from a VFS resource that is an OOXML MS Office document.

Supported formats are MS Word (.docx), MS PowerPoint (.pptx) and MS Excel (.xlsx).

The OLE 2 format was introduced in Microsoft Office version 97 and was the default format until Office version 2007 and the new XML-based OOXML format.

Since:
8.0.1