public class TikaInputTransformer extends Object implements InputTransformer
Constructor and Description |
---|
TikaInputTransformer(org.osgi.framework.BundleContext bundleContext,
MetacardType metacardType) |
Modifier and Type | Method and Description |
---|---|
void |
addContentMetadataExtractor(org.osgi.framework.ServiceReference<ContentMetadataExtractor> contentMetadataExtractorRef) |
void |
addMetadataExtractor(org.osgi.framework.ServiceReference<MetadataExtractor> metadataExtractorRef) |
protected void |
enrichMetacard(TemporaryFileBackedOutputStream fileBackedOutputStream,
String metacardContentType,
long bytes,
Metacard metacard) |
Optional<MetacardType> |
getMetacardTypeFromMimeType(String mimeType)
Determines which
MetacardType should be used to create a metacard for an input file of
a given mime type |
int |
getPreviewMaxLength() |
protected MetacardType |
mergeAttributes(MetacardType metacardType) |
void |
populateMimeTypeMap()
Populates the mimeTypeToMetacardMap for use in determining the
MetacardType that
corresponds to an ingested product's mimeType. |
void |
removeContentMetadataExtractor(org.osgi.framework.ServiceReference<ContentMetadataExtractor> contentMetadataExtractorRef) |
void |
removeMetadataExtractor(org.osgi.framework.ServiceReference<MetadataExtractor> metadataExtractorRef) |
void |
setCommonTikaMetacardType(MetacardType metacardType) |
void |
setFallbackExcelMetacardType(MetacardType metacardType) |
void |
setFallbackJpegMetacardType(MetacardType metacardType) |
void |
setFallbackMp4MetacardType(MetacardType metacardType) |
void |
setFallbackMpegMetacardType(MetacardType metacardType) |
void |
setFallbackOfficeDocMetacardType(MetacardType metacardType) |
void |
setFallbackPdfMetacardType(MetacardType metacardType) |
void |
setFallbackPowerpointMetacardType(MetacardType metacardType) |
void |
setMetadataMaxLength(int metadataMaxLength) |
void |
setPreviewMaxLength(int previewMaxLength) |
void |
setUseResourceTitleAsTitle(Boolean useResourceTitleAsTitle) |
Metacard |
transform(InputStream input)
Transforms the input into a
Metacard . |
Metacard |
transform(InputStream input,
String id)
|
public TikaInputTransformer(org.osgi.framework.BundleContext bundleContext, MetacardType metacardType)
public int getPreviewMaxLength()
public void setPreviewMaxLength(int previewMaxLength)
public void setMetadataMaxLength(int metadataMaxLength)
public void setCommonTikaMetacardType(MetacardType metacardType)
public void setFallbackExcelMetacardType(MetacardType metacardType)
public void setFallbackJpegMetacardType(MetacardType metacardType)
public void setFallbackMp4MetacardType(MetacardType metacardType)
public void setFallbackMpegMetacardType(MetacardType metacardType)
public void setFallbackOfficeDocMetacardType(MetacardType metacardType)
public void setFallbackPdfMetacardType(MetacardType metacardType)
public void setFallbackPowerpointMetacardType(MetacardType metacardType)
public void populateMimeTypeMap()
MetacardType
that
corresponds to an ingested product's mimeType.public Optional<MetacardType> getMetacardTypeFromMimeType(String mimeType)
MetacardType
should be used to create a metacard for an input file of
a given mime typemimeType
- the String representing the mime type of the fileOptional
of MetacardType
that should be used to create a Metacard
for the given mimeType. Returns empty Optional
if no MetacardType
matched the given mime type.public Metacard transform(InputStream input) throws IOException, CatalogTransformerException
InputTransformer
Metacard
.transform
in interface InputTransformer
input
- the binary InputStream
to transformMetacard
IOException
- if an I/O exception occurs when reading the InputStream
CatalogTransformerException
- if an error occurs during transformationpublic Metacard transform(InputStream input, String id) throws IOException, CatalogTransformerException
InputTransformer
transform
in interface InputTransformer
input
- the binary InputStream
to transformid
- the attribute value for the Metacard.ID
attribute that should be set in the
generated Metacard
Metacard
IOException
- if an I/O exception occurs when reading the InputStream
CatalogTransformerException
- if an error occurs during transformationpublic void addContentMetadataExtractor(org.osgi.framework.ServiceReference<ContentMetadataExtractor> contentMetadataExtractorRef)
public void addMetadataExtractor(org.osgi.framework.ServiceReference<MetadataExtractor> metadataExtractorRef)
public void removeContentMetadataExtractor(org.osgi.framework.ServiceReference<ContentMetadataExtractor> contentMetadataExtractorRef)
public void removeMetadataExtractor(org.osgi.framework.ServiceReference<MetadataExtractor> metadataExtractorRef)
public void setUseResourceTitleAsTitle(Boolean useResourceTitleAsTitle)
useResourceTitleAsTitle
- must be non-nullprotected MetacardType mergeAttributes(MetacardType metacardType)
protected void enrichMetacard(TemporaryFileBackedOutputStream fileBackedOutputStream, String metacardContentType, long bytes, Metacard metacard) throws IOException
IOException
This work is licensed under a Creative Commons Attribution 4.0 International License.