org.apache.lucene.analysis.fr
Class FrenchAnalyzer
public final class FrenchAnalyzer
Analyzer for French language. Supports an external list of stopwords (words that
will not be indexed at all) and an external list of exclusions (word that will
not be stemmed, but indexed).
A default set of stopwords is used unless an alternative list is specified, the
exclusion list is empty by default.
$Id: FrenchAnalyzer.java 178832 2005-05-27 23:00:49Z dnaber $- Patrick Talbot (based on Gerhard Schwarz's work for German)
void | setStemExclusionTable(File exclusionlist) - Builds an exclusionlist from the words contained in the given file.
|
void | setStemExclusionTable(Hashtable exclusionlist) - Builds an exclusionlist from a Hashtable.
|
void | setStemExclusionTable(String[] exclusionlist) - Builds an exclusionlist from an array of Strings.
|
TokenStream | tokenStream(String fieldName, Reader reader) - Creates a TokenStream which tokenizes all the text in the provided Reader.
|
FRENCH_STOP_WORDS
public static final String[] FRENCH_STOP_WORDS
Extended list of typical French stopwords.
FrenchAnalyzer
public FrenchAnalyzer()
FrenchAnalyzer
public FrenchAnalyzer(File stopwords)
throws IOException
Builds an analyzer with the given stop words.
FrenchAnalyzer
public FrenchAnalyzer(Hashtable stopwords)
Builds an analyzer with the given stop words.
FrenchAnalyzer
public FrenchAnalyzer(String[] stopwords)
Builds an analyzer with the given stop words.
setStemExclusionTable
public void setStemExclusionTable(File exclusionlist)
throws IOException
Builds an exclusionlist from the words contained in the given file.
setStemExclusionTable
public void setStemExclusionTable(Hashtable exclusionlist)
Builds an exclusionlist from a Hashtable.
setStemExclusionTable
public void setStemExclusionTable(String[] exclusionlist)
Builds an exclusionlist from an array of Strings.
tokenStream
public final TokenStream tokenStream(String fieldName,
Reader reader)
Creates a TokenStream which tokenizes all the text in the provided Reader.
- tokenStream in interface Analyzer
- A TokenStream build from a StandardTokenizer filtered with
StandardFilter, StopFilter, FrenchStemFilter and LowerCaseFilter
Copyright © 2000-2007 Apache Software Foundation. All Rights Reserved.