org.apache.lucene.analysis.ru
Class RussianAnalyzer
public final class RussianAnalyzer
Analyzer for Russian language. Supports an external list of stopwords (words that
will not be indexed at all).
A default set of stopwords is used unless an alternative list is specified.
$Id: RussianAnalyzer.java 150998 2004-08-16 20:30:46Z dnaber $- Boris Okner, b.okner@rogers.com
TokenStream | tokenStream(String fieldName, Reader reader) - Creates a TokenStream which tokenizes all the text in the provided Reader.
|
RussianAnalyzer
public RussianAnalyzer()
RussianAnalyzer
public RussianAnalyzer(char[] charset)
Builds an analyzer.
RussianAnalyzer
public RussianAnalyzer(char[] charset,
Hashtable stopwords)
Builds an analyzer with the given stop words.
- create a Set version of this ctor
RussianAnalyzer
public RussianAnalyzer(char[] charset,
String[] stopwords)
Builds an analyzer with the given stop words.
tokenStream
public TokenStream tokenStream(String fieldName,
Reader reader)
Creates a TokenStream which tokenizes all the text in the provided Reader.
- tokenStream in interface Analyzer
- A TokenStream build from a RussianLetterTokenizer filtered with
RussianLowerCaseFilter, StopFilter, and RussianStemFilter
Copyright © 2000-2007 Apache Software Foundation. All Rights Reserved.