public abstract class StopwordAnalyzerBase extends ReusableAnalyzerBase
ReusableAnalyzerBase.TokenStreamComponents
Modifier and Type | Field and Description |
---|---|
protected Version |
matchVersion |
protected CharArraySet |
stopwords
An immutable stopword set
|
Modifier | Constructor and Description |
---|---|
protected |
StopwordAnalyzerBase(Version version)
Creates a new Analyzer with an empty stopword set
|
protected |
StopwordAnalyzerBase(Version version,
java.util.Set<?> stopwords)
Creates a new instance initialized with the given stopword set
|
Modifier and Type | Method and Description |
---|---|
java.util.Set<?> |
getStopwordSet()
Returns the analyzer's stopword set or an empty set if the analyzer has no
stopwords
|
protected static CharArraySet |
loadStopwordSet(boolean ignoreCase,
java.lang.Class<? extends ReusableAnalyzerBase> aClass,
java.lang.String resource,
java.lang.String comment)
Creates a CharArraySet from a file resource associated with a class.
|
protected static CharArraySet |
loadStopwordSet(java.io.File stopwords,
Version matchVersion)
Creates a CharArraySet from a file.
|
protected static CharArraySet |
loadStopwordSet(java.io.Reader stopwords,
Version matchVersion)
Creates a CharArraySet from a file.
|
createComponents, initReader, reusableTokenStream, tokenStream
close, getOffsetGap, getPositionIncrementGap, getPreviousTokenStream, setPreviousTokenStream
protected final CharArraySet stopwords
protected final Version matchVersion
protected StopwordAnalyzerBase(Version version, java.util.Set<?> stopwords)
version
- the Lucene version for cross version compatibilitystopwords
- the analyzer's stopword setprotected StopwordAnalyzerBase(Version version)
version
- the Lucene version for cross version compatibilitypublic java.util.Set<?> getStopwordSet()
protected static CharArraySet loadStopwordSet(boolean ignoreCase, java.lang.Class<? extends ReusableAnalyzerBase> aClass, java.lang.String resource, java.lang.String comment) throws java.io.IOException
Class.getResourceAsStream(String)
).ignoreCase
- true
if the set should ignore the case of the
stopwords, otherwise false
aClass
- a class that is associated with the given stopwordResourceresource
- name of the resource file associated with the given classcomment
- comment string to ignore in the stopword filejava.io.IOException
- if loading the stopwords throws an IOException
protected static CharArraySet loadStopwordSet(java.io.File stopwords, Version matchVersion) throws java.io.IOException
stopwords
- the stopwords file to loadmatchVersion
- the Lucene version for cross version compatibilityjava.io.IOException
- if loading the stopwords throws an IOException
protected static CharArraySet loadStopwordSet(java.io.Reader stopwords, Version matchVersion) throws java.io.IOException
stopwords
- the stopwords reader to loadmatchVersion
- the Lucene version for cross version compatibilityjava.io.IOException
- if loading the stopwords throws an IOException