public final class RussianStemFilter extends TokenFilter
TokenFilter
that stems Russian words.
The implementation was inspired by GermanStemFilter.
The input should be filtered by RussianLowerCaseFilter
before passing it to RussianStemFilter ,
because RussianStemFilter only works with lowercase part of any "russian" charset.
AttributeSource.AttributeFactory, AttributeSource.State
input
Constructor and Description |
---|
RussianStemFilter(TokenStream in) |
RussianStemFilter(TokenStream in,
char[] charset)
Deprecated.
Use
RussianStemFilter(TokenStream) instead. |
Modifier and Type | Method and Description |
---|---|
boolean |
incrementToken()
Returns the next token in the stream, or null at EOS
|
void |
setStemmer(org.apache.lucene.analysis.ru.RussianStemmer stemmer)
Set a alternative/custom
RussianStemmer for this filter. |
close, end, reset
getOnlyUseNewAPI, next, next, setOnlyUseNewAPI
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, restoreState, toString
public RussianStemFilter(TokenStream in, char[] charset)
RussianStemFilter(TokenStream)
instead.public RussianStemFilter(TokenStream in)
public final boolean incrementToken() throws IOException
incrementToken
in class TokenStream
Note that this method will be defined abstract in Lucene 3.0.
IOException
public void setStemmer(org.apache.lucene.analysis.ru.RussianStemmer stemmer)
RussianStemmer
for this filter.Copyright © 2000-2012 Apache Software Foundation. All Rights Reserved.