com.ibm.icu.text
Class BreakDictionary
java.lang.Object
com.ibm.icu.text.BreakDictionary
- public class BreakDictionary
- extends java.lang.Object
This is the class that represents the list of known words used by
DictionaryBasedBreakIterator. The conceptual data structure used
here is a trie: there is a node hanging off the root node for every
letter that can start a word. Each of these nodes has a node hanging
off of it for every letter that can be the second letter of a word
if this node is the first letter, and so on. The trie is represented
as a two-dimensional array that can be treated as a table of state
transitions. Indexes are used to compress this array, taking
advantage of the fact that this array will always be very sparse.
- Status:
- Internal. This API is ICU internal only.
Method Summary |
short |
at(int row,
char ch)
Uses the column map to map the character to a column number, then
passes the row and column number to the other version of at() |
short |
at(int row,
int col)
Returns the value in the cell with the specified (logical) row and
column numbers. |
static void |
main(java.lang.String[] args)
|
void |
printWordList(java.lang.String partialWord,
int state,
java.io.PrintWriter out)
|
void |
readDictionaryFile(java.io.DataInputStream in)
|
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
BreakDictionary
public BreakDictionary(java.io.InputStream dictionaryStream)
throws java.io.IOException
- Status:
- Internal. This API is ICU internal only.
main
public static void main(java.lang.String[] args)
throws java.io.FileNotFoundException,
java.io.UnsupportedEncodingException,
java.io.IOException
- Throws:
java.io.FileNotFoundException
java.io.UnsupportedEncodingException
java.io.IOException
- Status:
- Internal. This API is ICU internal only.
printWordList
public void printWordList(java.lang.String partialWord,
int state,
java.io.PrintWriter out)
throws java.io.IOException
- Throws:
java.io.IOException
- Status:
- Internal. This API is ICU internal only.
readDictionaryFile
public void readDictionaryFile(java.io.DataInputStream in)
throws java.io.IOException
- Throws:
java.io.IOException
- Status:
- Internal. This API is ICU internal only.
at
public final short at(int row,
char ch)
- Uses the column map to map the character to a column number, then
passes the row and column number to the other version of at()
- Parameters:
row
- The current statech
- The character whose column we're interested in
- Returns:
- The new state to transition to
- Status:
- Internal. This API is ICU internal only.
at
public final short at(int row,
int col)
- Returns the value in the cell with the specified (logical) row and
column numbers. In DictionaryBasedBreakIterator, the row number is
a state number, the column number is an input, and the return value
is the row number of the new state to transition to. (0 is the
"error" state, and -1 is the "end of word" state in a dictionary)
- Parameters:
row
- The row number of the current statecol
- The column number of the input character (0 means "not a
dictionary character")
- Returns:
- The row number of the new state to transition to
- Status:
- Internal. This API is ICU internal only.
Copyright (c) 2006 IBM Corporation and others.