public interface ITextExtractor
ITextExtractor
implementation is compatible with a specific list of file
extensions. If multiple implementation are found with compatibility on the same file extension an exception is thrown and extraction
cannot be proceeded.
An ITextExtractor
implementation must be declared using Service Provider Interface
( @see SPI ).
Modifier and Type | Method and Description |
---|---|
void |
extract(BufferedInputStream in,
BufferedOutputStream out)
Extracts the text from the content read from the input stream.
|
Collection<String> |
retrieveCompatibleFileExtensions()
Returns a collection of compatible file extensions
|
Collection<String> retrieveCompatibleFileExtensions()
void extract(BufferedInputStream in, BufferedOutputStream out) throws IOException
This extract method can be called several time on the same ITextExtractor
instance with different content to process
in
- the input stream from which to extract the textout
- the output stream to write the text result toIOException
- a potential IOException
in case of errorCopyright © 2022 Visiativ. All rights reserved.