Package | Description |
---|---|
org.apache.lucene.analysis |
API and code to convert text into indexable tokens.
|
org.apache.lucene.analysis.br |
Analyzer for Brazilian.
|
org.apache.lucene.analysis.cn |
Analyzer for Chinese.
|
org.apache.lucene.analysis.de |
Analyzer for German.
|
org.apache.lucene.analysis.el |
Analyzer for Greek.
|
org.apache.lucene.analysis.fr |
Analyzer for French.
|
org.apache.lucene.analysis.nl |
Analyzer for Dutch.
|
org.apache.lucene.analysis.ru |
Analyzer for Russian.
|
org.apache.lucene.analysis.snowball |
TokenFilter and Analyzer implementations that use Snowball
stemmers. |
org.apache.lucene.analysis.standard |
A grammar-based tokenizer constructed with JavaCC.
|
org.apache.lucene.index.memory |
High-performance single-document main memory Apache Lucene fulltext search index.
|
Modifier and Type | Class and Description |
---|---|
class |
ISOLatin1AccentFilter
A filter that replaces accented characters in the ISO Latin 1 character set
(ISO-8859-1) by their unaccented equivalent.
|
class |
LengthFilter
Removes words that are too long and too short from the stream.
|
class |
LowerCaseFilter
Normalizes token text to lower case.
|
class |
PorterStemFilter
Transforms the token stream as per the Porter stemming algorithm.
|
class |
StopFilter
Removes stop words from a token stream.
|
Modifier and Type | Class and Description |
---|---|
class |
BrazilianStemFilter
Based on GermanStemFilter
|
Modifier and Type | Class and Description |
---|---|
class |
ChineseFilter
Title: ChineseFilter
Description: Filter with a stop word table
Rule: No digital is allowed.
|
Modifier and Type | Class and Description |
---|---|
class |
GermanStemFilter
A filter that stems German words.
|
Modifier and Type | Class and Description |
---|---|
class |
GreekLowerCaseFilter
Normalizes token text to lower case, analyzing given ("greek") charset.
|
Modifier and Type | Class and Description |
---|---|
class |
FrenchStemFilter
A filter that stemms french words.
|
Modifier and Type | Class and Description |
---|---|
class |
DutchStemFilter
A filter that stems Dutch words.
|
Modifier and Type | Class and Description |
---|---|
class |
RussianLowerCaseFilter
Normalizes token text to lower case, analyzing given ("russian") charset.
|
class |
RussianStemFilter
A filter that stems Russian words.
|
Modifier and Type | Class and Description |
---|---|
class |
SnowballFilter
A filter that stems words using a Snowball-generated stemmer.
|
Modifier and Type | Class and Description |
---|---|
class |
StandardFilter
Normalizes tokens extracted with
StandardTokenizer . |
Modifier and Type | Class and Description |
---|---|
class |
SynonymTokenFilter
Injects additional tokens for synonyms of token terms fetched from the
underlying child stream; the child stream must deliver lowercase tokens
for synonyms to be found.
|
Copyright © 2000-2013 Apache Software Foundation. All Rights Reserved.