public class OpenDocumentDocumentFactory extends AbstractSimpleTikaDocumentFactory
The only metadata that will be parsed is GreedyTikaField.NAME.
PropertyBasedDocumentFactory.MetadataKeysDocumentFactory.FieldTypedefaultMetadata| Constructor and Description |
|---|
OpenDocumentDocumentFactory() |
OpenDocumentDocumentFactory(Properties properties) |
OpenDocumentDocumentFactory(Reference2ObjectMap<Enum<?>,Object> defaultMetadata) |
OpenDocumentDocumentFactory(String[] property) |
| Modifier and Type | Method and Description |
|---|---|
protected org.apache.tika.parser.Parser |
getParser()
The parser to be used to parse this kind of documents; subclasses should return always the same instance, as Tika parsers are immutable and thread-safe.
|
protected List<? extends TikaField> |
metadataFields()
The list of Tika fields (apart for content) that this factory provides; it returns the empty list, so most subclasses may want to override this method.
|
copy, fields, getDocument, parsePropertyfieldIndex, fieldName, fieldType, numberOfFieldsensureJustOne, getInstance, getInstance, getInstance, getInstance, parseProperties, parseProperties, resolve, resolve, resolveNotNull, sameKeyensureFieldIndex, toStringpublic OpenDocumentDocumentFactory()
public OpenDocumentDocumentFactory(Reference2ObjectMap<Enum<?>,Object> defaultMetadata)
public OpenDocumentDocumentFactory(Properties properties) throws ConfigurationException
ConfigurationExceptionpublic OpenDocumentDocumentFactory(String[] property) throws ConfigurationException
ConfigurationExceptionprotected List<? extends TikaField> metadataFields()
AbstractSimpleTikaDocumentFactorymetadataFields in class AbstractSimpleTikaDocumentFactoryprotected org.apache.tika.parser.Parser getParser()
AbstractSimpleTikaDocumentFactorygetParser in class AbstractSimpleTikaDocumentFactory