org.apache.solr.analysis
Class HTMLStripCharFilter
java.lang.Object
java.io.Reader
org.apache.lucene.analysis.CharStream
org.apache.lucene.analysis.CharFilter
org.apache.lucene.analysis.BaseCharFilter
org.apache.solr.analysis.HTMLStripCharFilter
- All Implemented Interfaces:
- Closeable, Readable
- Direct Known Subclasses:
- HTMLStripReader
public class HTMLStripCharFilter
- extends BaseCharFilter
A CharFilter that wraps another Reader and attempts to strip out HTML constructs.
- Version:
- $Id: HTMLStripCharFilter.java 826299 2009-10-17 19:56:01Z yonik $
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
DEFAULT_READ_AHEAD
public static final int DEFAULT_READ_AHEAD
- See Also:
- Constant Field Values
HTMLStripCharFilter
public HTMLStripCharFilter(CharStream source)
HTMLStripCharFilter
public HTMLStripCharFilter(CharStream source,
Set<String> escapedTags)
HTMLStripCharFilter
public HTMLStripCharFilter(CharStream source,
Set<String> escapedTags,
int readAheadLimit)
main
public static void main(String[] args)
throws IOException
- Throws:
IOException
getReadAheadLimit
public int getReadAheadLimit()
read
public int read()
throws IOException
- Overrides:
read
in class Reader
- Throws:
IOException
read
public int read(char[] cbuf,
int off,
int len)
throws IOException
- Overrides:
read
in class CharFilter
- Throws:
IOException
close
public void close()
throws IOException
- Specified by:
close
in interface Closeable
- Overrides:
close
in class CharFilter
- Throws:
IOException
Copyright © 2011 Apache Software Foundation. All Rights Reserved.