public interface ITextExtractorProcessor
Extraction on one or more file extensions can be disabled using the configuration key "index.system.extractor.text.disable.document.types" by listing the disabled file extensions separated by commas.
Modifier and Type | Method and Description |
---|---|
boolean |
canHandleExtractionFor(String fileExtension)
Indicates if a file extension can be processed to extract text from
|
void |
extract(BufferedInputStream in,
BufferedOutputStream out,
String fileExtension)
Extracts the text from the content read in the input stream.
|
void extract(BufferedInputStream in, BufferedOutputStream out, String fileExtension) throws ExtractorNotFoundException, IOException
in
- the input stream from which to extract the textout
- the output stream to write the text result tofileExtension
- the file extension indicating the type of contentExtractorNotFoundException
- if no suitable extractor can be used for the type of contentIOException
- a potential IOException
in case of errorboolean canHandleExtractionFor(String fileExtension)
fileExtension
- the file extensionCopyright © 2022 Visiativ. All rights reserved.