Package org.opencms.search.extractors
Class CmsExtractorRtf
java.lang.Object
org.opencms.search.extractors.A_CmsTextExtractor
org.opencms.search.extractors.CmsExtractorRtf
- All Implemented Interfaces:
I_CmsTextExtractor
Extracts the text from a RTF document.
- Since:
- 6.0.0
-
Method Summary
Modifier and TypeMethodDescriptionExtracts the text and meta information from the document on the input stream.static I_CmsTextExtractor
Returns an instance of this text extractor.Methods inherited from class org.opencms.search.extractors.A_CmsTextExtractor
combineContentItem, extractText, extractText, extractText, extractText, removeControlChars
-
Method Details
-
getExtractor
Returns an instance of this text extractor.- Returns:
- an instance of this text extractor
-
extractText
Description copied from interface:I_CmsTextExtractor
Extracts the text and meta information from the document on the input stream.The encoding of the input stream is either not required (the document type may have one common default encoding) or the extractor is able to divine the encoding from the provided input stream automatically.
Delivers is the same result as calling
whenI_CmsTextExtractor.extractText(InputStream, String)
String == null
.- Specified by:
extractText
in interfaceI_CmsTextExtractor
- Overrides:
extractText
in classA_CmsTextExtractor
- Parameters:
in
- the input stream for the document to extract the text from- Returns:
- the extracted text and meta information
- Throws:
Exception
- if the text extration fails- See Also:
-