Class CmsDocumentMsOfficeOOXML

java.lang.Object
org.opencms.search.documents.A_CmsVfsDocument
org.opencms.search.documents.CmsDocumentMsOfficeOOXML
All Implemented Interfaces:
I_CmsDocumentFactory, I_CmsSearchExtractor

Lucene document factory class to extract text data from a VFS resource that is an OOXML MS Office document.

Supported formats are MS Word (.docx), MS PowerPoint (.pptx) and MS Excel (.xlsx).

The OLE 2 format was introduced in Microsoft Office version 97 and was the default format until Office version 2007 and the new XML-based OOXML format.

Since:
8.0.1