public class NodeTokenizer extends Object
Splits a document into tokens using the following delimiters "<>". The tokens are represented by a starting and ending offset so a bunch of strings are not created until the content is needed.
Constructor and Description |
---|
NodeTokenizer(StringBuffer buffer)
Constructor with the complete document to parse.
|
Modifier and Type | Method and Description |
---|---|
protected void |
index(ArrayList tokenIndex)
This method is passed an empty
ArrayList that should be
populated into Token offsets. |
Iterator |
iterator()
Returns an implementation of the
Iterator
interface to enumerate the nodes within the document. |
public NodeTokenizer(StringBuffer buffer)
Constructor with the complete document to parse.
buffer
- documentCopyright © 2004-2013 Apache Software Foundation. All Rights Reserved.