public interface ITextExtractorProcessor
Extraction on one or more file extensions can be disabled using the configuration key "index.system.extractor.text.disable.document.types" by listing the disabled file extensions separated by commas.
| Modifier and Type | Method and Description |
|---|---|
boolean |
canHandleExtractionFor(String fileExtension)
Indicates if a file extension can be processed to extract text from
|
void |
extract(BufferedInputStream in,
BufferedOutputStream out,
String fileExtension)
Extracts the text from the content read in the input stream.
|
void extract(BufferedInputStream in, BufferedOutputStream out, String fileExtension) throws ExtractorNotFoundException, IOException
in - the input stream from which to extract the textout - the output stream to write the text result tofileExtension - the file extension indicating the type of contentExtractorNotFoundException - if no suitable extractor can be used for the type of contentIOException - a potential IOException in case of errorboolean canHandleExtractionFor(String fileExtension)
fileExtension - the file extensionCopyright © 2022 Visiativ. All rights reserved.