org.apache.slide.extractor
Class AbstractContentExtractor
java.lang.Object
org.apache.slide.extractor.AbstractContentExtractor
- All Implemented Interfaces:
- ContentExtractor, Extractor
- Direct Known Subclasses:
- MSExcelExtractor, MSPowerPointExtractor, MSWordExtractor, PDFExtractor
- public abstract class AbstractContentExtractor
- extends java.lang.Object
- implements ContentExtractor
The AbstractPropertyExtractor class
Method Summary |
abstract java.io.Reader |
extract(java.io.InputStream content)
Will be called before content gets stored |
java.lang.String |
getContentType()
|
java.lang.String |
getNamespace()
|
java.lang.String |
getUri()
|
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
AbstractContentExtractor
public AbstractContentExtractor(java.lang.String uri,
java.lang.String contentType)
AbstractContentExtractor
public AbstractContentExtractor(java.lang.String uri,
java.lang.String contentType,
java.lang.String namespace)
extract
public abstract java.io.Reader extract(java.io.InputStream content)
throws ExtractorException
- Description copied from interface:
ContentExtractor
- Will be called before content gets stored
- Specified by:
extract
in interface ContentExtractor
- Throws:
ExtractorException
getContentType
public java.lang.String getContentType()
- Specified by:
getContentType
in interface Extractor
getUri
public java.lang.String getUri()
- Specified by:
getUri
in interface Extractor
getNamespace
public java.lang.String getNamespace()
- Specified by:
getNamespace
in interface Extractor