public class JapaneseAnalyzer extends StopwordAnalyzerBase
JapaneseTokenizer
ReusableAnalyzerBase.TokenStreamComponents
matchVersion, stopwords
Constructor and Description |
---|
JapaneseAnalyzer(Version matchVersion) |
JapaneseAnalyzer(Version matchVersion,
UserDictionary userDict,
JapaneseTokenizer.Mode mode,
CharArraySet stopwords,
java.util.Set<java.lang.String> stoptags) |
Modifier and Type | Method and Description |
---|---|
protected ReusableAnalyzerBase.TokenStreamComponents |
createComponents(java.lang.String fieldName,
java.io.Reader reader)
Creates a new
ReusableAnalyzerBase.TokenStreamComponents instance for this analyzer. |
static CharArraySet |
getDefaultStopSet() |
static java.util.Set<java.lang.String> |
getDefaultStopTags() |
getStopwordSet, loadStopwordSet, loadStopwordSet, loadStopwordSet
initReader, reusableTokenStream, tokenStream
close, getOffsetGap, getPositionIncrementGap, getPreviousTokenStream, setPreviousTokenStream
public JapaneseAnalyzer(Version matchVersion)
public JapaneseAnalyzer(Version matchVersion, UserDictionary userDict, JapaneseTokenizer.Mode mode, CharArraySet stopwords, java.util.Set<java.lang.String> stoptags)
public static CharArraySet getDefaultStopSet()
public static java.util.Set<java.lang.String> getDefaultStopTags()
protected ReusableAnalyzerBase.TokenStreamComponents createComponents(java.lang.String fieldName, java.io.Reader reader)
ReusableAnalyzerBase
ReusableAnalyzerBase.TokenStreamComponents
instance for this analyzer.createComponents
in class ReusableAnalyzerBase
fieldName
- the name of the fields content passed to the
ReusableAnalyzerBase.TokenStreamComponents
sink as a readerreader
- the reader passed to the Tokenizer
constructorReusableAnalyzerBase.TokenStreamComponents
for this analyzer.