Uses of Interface
org.opencms.search.extractors.I_CmsExtractionResult
Package
Description
Implements the main full text search and indexing functions available in OpenCms.
Handles indexing different sorts of document and resource type from the OpenCms VFS for the full text search.
Contains a generic, low-level framework for extration of plain text content out of various popular file formats.
These classes control the mapping of the OpenCms content to the Lucene search fields.
The package contains the Solr search integration.
-
Uses of I_CmsExtractionResult in org.opencms.search
Modifier and TypeMethodDescriptionA_CmsSearchIndex.getContentIfUnchanged
(CmsResource resource) We always assume we have no unchanged copy of the content, since it depends on the concrete index.CmsSearchIndex.getContentIfUnchanged
(CmsResource resource) I_CmsSearchIndex.getContentIfUnchanged
(CmsResource resource) The method should return the extraction result of a content from the index, if sure the content has not changed since last indexing. -
Uses of I_CmsExtractionResult in org.opencms.search.documents
Modifier and TypeMethodDescriptionCmsDocumentContainerPage.extractContent
(CmsObject cms, CmsResource resource, I_CmsSearchIndex index) Returns the raw text content of a VFS resource of typeCmsResourceTypeContainerPage
.CmsDocumentGeneric.extractContent
(CmsObject cms, CmsResource resource, I_CmsSearchIndex index) Just returns an empty extraction result since the content can't be extracted form a generic resource.CmsDocumentHtml.extractContent
(CmsObject cms, CmsResource resource, I_CmsSearchIndex index) Returns the raw text content of a given VFS resource containing HTML data.CmsDocumentMsOfficeOLE2.extractContent
(CmsObject cms, CmsResource resource, I_CmsSearchIndex index) Returns the raw text content of a given vfs resource containing MS Word data.CmsDocumentMsOfficeOOXML.extractContent
(CmsObject cms, CmsResource resource, I_CmsSearchIndex index) Returns the raw text content of a given vfs resource containing MS Word data.CmsDocumentOpenOffice.extractContent
(CmsObject cms, CmsResource resource, I_CmsSearchIndex index) Returns the raw text content of a given vfs resource containing MS Word data.CmsDocumentPdf.extractContent
(CmsObject cms, CmsResource resource, I_CmsSearchIndex index) Returns the raw text content of a given vfs resource containing Adobe PDF data.CmsDocumentPlainText.extractContent
(CmsObject cms, CmsResource resource, I_CmsSearchIndex index) Returns the raw text content of a given vfs resource containing plain text data.CmsDocumentRtf.extractContent
(CmsObject cms, CmsResource resource, I_CmsSearchIndex index) Returns the raw text content of a given vfs resource containing RTF data.CmsDocumentXmlContent.extractContent
(CmsObject cms, CmsResource resource, I_CmsSearchIndex index) Returns the raw text content of a given VFS resource of typeCmsResourceTypeXmlContent
.CmsDocumentXmlPage.extractContent
(CmsObject cms, CmsResource resource, I_CmsSearchIndex index) Returns the raw text content of a given vfs resource of typeCmsResourceTypeXmlPage
.I_CmsSearchExtractor.extractContent
(CmsObject cms, CmsResource resource, I_CmsSearchIndex index) Extracts the content of a given index resource according to the resource file type and the configuration of the given index.Modifier and TypeMethodDescriptionvoid
CmsExtractionResultCache.saveCacheObject
(String rfsName, I_CmsExtractionResult content) Serializes the given extraction result and saves it in the disk cache. -
Uses of I_CmsExtractionResult in org.opencms.search.extractors
Modifier and TypeClassDescriptionclass
The result of a document text extraction.Modifier and TypeMethodDescriptionA_CmsTextExtractor.extractText
(byte[] content) A_CmsTextExtractor.extractText
(byte[] content, String encoding) A_CmsTextExtractor.extractText
(InputStream in) A_CmsTextExtractor.extractText
(InputStream in, String encoding) CmsExtractorHtml.extractText
(InputStream in, String encoding) CmsExtractorMsOfficeOLE2.extractText
(InputStream in) CmsExtractorMsOfficeOOXML.extractText
(InputStream in) CmsExtractorOpenOffice.extractText
(InputStream in, String encoding) CmsExtractorPdf.extractText
(InputStream in) CmsExtractorRtf.extractText
(InputStream in) I_CmsTextExtractor.extractText
(byte[] content) Extracts the text and meta information from the given binary document.I_CmsTextExtractor.extractText
(byte[] content, String encoding) Extracts the text and meta information from the given binary document, using the specified content encoding.I_CmsTextExtractor.extractText
(InputStream in) Extracts the text and meta information from the document on the input stream.I_CmsTextExtractor.extractText
(InputStream in, String encoding) Extracts the text and meta information from the document on the input stream, using the specified content encoding.CmsExtractionResult.merge
(List<I_CmsExtractionResult> extractionResults) I_CmsExtractionResult.merge
(List<I_CmsExtractionResult> extractionResults) Appends, for the locales of the current collection result, the content fields from all provided extraction results to the current extraction result.Modifier and TypeMethodDescriptionCmsExtractionResult.merge
(List<I_CmsExtractionResult> extractionResults) I_CmsExtractionResult.merge
(List<I_CmsExtractionResult> extractionResults) Appends, for the locales of the current collection result, the content fields from all provided extraction results to the current extraction result. -
Uses of I_CmsExtractionResult in org.opencms.search.fields
Modifier and TypeMethodDescriptionprotected I_CmsSearchDocument
CmsSearchFieldConfiguration.appendAdditionalValuesToDcoument
(I_CmsSearchDocument document, CmsObject cms, CmsResource resource, I_CmsExtractionResult extraction, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) Overriding this method allows to append some 'extra' values/fields to a document without overriding theCmsSearchFieldConfiguration.createDocument(org.opencms.file.CmsObject, org.opencms.file.CmsResource, org.opencms.search.I_CmsSearchIndex, org.opencms.search.extractors.I_CmsExtractionResult)
method itself.protected I_CmsSearchDocument
CmsSearchFieldConfiguration.appendCategories
(I_CmsSearchDocument document, CmsObject cms, CmsResource resource, I_CmsExtractionResult extractionResult, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) Extends the given document by resource category information based on properties.protected I_CmsSearchDocument
CmsSearchFieldConfigurationOldCategories.appendCategories
(I_CmsSearchDocument document, CmsObject cms, CmsResource resource, I_CmsExtractionResult extractionResult, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) Extends the given document by resource category information based on properties.protected I_CmsSearchDocument
CmsSearchFieldConfiguration.appendContentBlob
(I_CmsSearchDocument document, CmsObject cms, CmsResource resource, I_CmsExtractionResult extractionResult, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) Extends the given document by a field that contains the extracted content blob.protected I_CmsSearchDocument
CmsSearchFieldConfiguration.appendDates
(I_CmsSearchDocument document, CmsObject cms, CmsResource resource, I_CmsExtractionResult extractionResult, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) Extends the given document by fields for date of creation, content and last modification.protected I_CmsSearchDocument
CmsSearchFieldConfiguration.appendFieldMapping
(I_CmsSearchDocument document, CmsSearchField field, CmsObject cms, CmsResource resource, I_CmsExtractionResult extractionResult, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) Extends the given document by the mappings for the given field.protected I_CmsSearchDocument
CmsSearchFieldConfiguration.appendFieldMappings
(I_CmsSearchDocument document, CmsObject cms, CmsResource resource, I_CmsExtractionResult extractionResult, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) Extends the given document by the configured field mappings.protected I_CmsSearchDocument
CmsSearchFieldConfiguration.appendFileSize
(I_CmsSearchDocument document, CmsObject cms, CmsResource resource, I_CmsExtractionResult extractionResult, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) Extends the given document by the "size" field.protected I_CmsSearchDocument
CmsSearchFieldConfiguration.appendLocales
(I_CmsSearchDocument document, CmsObject cms, CmsResource resource, I_CmsExtractionResult extraction, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) Extends the given document by the "res_locales" field.protected I_CmsSearchDocument
CmsSearchFieldConfiguration.appendPath
(I_CmsSearchDocument document, CmsObject cms, CmsResource resource, I_CmsExtractionResult extractionResult, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) Extends the given document by fields for VFS path lookup.protected I_CmsSearchDocument
CmsSearchFieldConfiguration.appendProperties
(I_CmsSearchDocument document, CmsObject cms, CmsResource resource, I_CmsExtractionResult extraction, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) Appends all direct properties, that are not empty or white space only to the document.protected I_CmsSearchDocument
CmsSearchFieldConfiguration.appendType
(I_CmsSearchDocument document, CmsObject cms, CmsResource resource, I_CmsExtractionResult extractionResult, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) Extends the given document by a field that contains the resource type name.CmsSearchFieldConfiguration.createDocument
(CmsObject cms, CmsResource resource, I_CmsSearchIndex index, I_CmsExtractionResult extraction) Creates the Lucene Document with this field configuration for the provided VFS resource, search index and content.I_CmsSearchFieldConfiguration.createDocument
(CmsObject cms, CmsResource resource, I_CmsSearchIndex index, I_CmsExtractionResult extractionResult) Creates the document to index.CmsGeoCoordinateFieldMapping.getStringValue
(int depth, CmsGeoMappingConfiguration mappingConfig, CmsObject cms, CmsResource res, I_CmsExtractionResult extractionResult) Gets the mapped value.CmsGeoCoordinateFieldMapping.getStringValue
(CmsObject cms, CmsResource res, I_CmsExtractionResult extractionResult, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) CmsMacroResolvingSearchFieldMapping.getStringValue
(CmsObject cms, CmsResource res, I_CmsExtractionResult extractionResult, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) Calls the super method and resolves macros in the returned value.CmsMacroSearchFieldMapping.getStringValue
(CmsObject cms, CmsResource res, I_CmsExtractionResult extractionResult, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) CmsSearchFieldMapping.getStringValue
(CmsObject cms, CmsResource res, I_CmsExtractionResult extractionResult, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) I_CmsSearchFieldMapping.getStringValue
(CmsObject cms, CmsResource res, I_CmsExtractionResult extractionResult, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) Returns the String value extracted form the provided data according to the rules of this mapping type. -
Uses of I_CmsExtractionResult in org.opencms.search.solr
Modifier and TypeMethodDescriptionCmsSolrDocumentContainerPage.extractContent
(CmsObject cms, CmsResource resource, I_CmsSearchIndex index) Returns the raw text content of a VFS resource of typeCmsResourceTypeContainerPage
.CmsSolrDocumentContainerPage.extractContent
(CmsObject cms, CmsResource resource, I_CmsSearchIndex index, Locale forceLocale) Extracts the content of a given index resource according to the resource file type and the configuration of the given index.CmsSolrDocumentXmlContent.extractContent
(CmsObject cms, CmsResource resource, I_CmsSearchIndex index) Modifier and TypeMethodDescriptionprotected I_CmsSearchDocument
CmsSolrFieldConfiguration.appendAdditionalValuesToDcoument
(I_CmsSearchDocument document, CmsObject cms, CmsResource resource, I_CmsExtractionResult extractionResult, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) protected I_CmsSearchDocument
CmsSolrFieldConfiguration.appendDates
(I_CmsSearchDocument document, CmsObject cms, CmsResource resource, I_CmsExtractionResult extractionResult, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) protected I_CmsSearchDocument
CmsSolrFieldConfiguration.appendFieldMapping
(I_CmsSearchDocument document, CmsSearchField sfield, CmsObject cms, CmsResource resource, I_CmsExtractionResult extractionResult, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) protected I_CmsSearchDocument
CmsSolrFieldConfiguration.appendFieldMappings
(I_CmsSearchDocument document, CmsObject cms, CmsResource resource, I_CmsExtractionResult extractionResult, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) protected I_CmsSearchDocument
CmsSolrFieldConfiguration.appendLocales
(I_CmsSearchDocument document, CmsObject cms, CmsResource resource, I_CmsExtractionResult extraction, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) protected I_CmsSearchDocument
CmsSolrFieldConfiguration.appendProperties
(I_CmsSearchDocument document, CmsObject cms, CmsResource resource, I_CmsExtractionResult extraction, List<CmsProperty> properties, List<CmsProperty> propertiesSearched) CmsSolrFieldConfiguration.getContentLocales
(CmsObject cms, CmsResource resource, I_CmsExtractionResult extraction) Retrieves the locales for an content, that is whether an XML content nor an XML page.