org.htmlparser.nodes

Class TextNode

public class TextNode extends AbstractNode implements Text

Normal text in the HTML document is represented by this class.
Field Summary
protected StringmText
The contents of the string node, or override text.
Constructor Summary
TextNode(String text)
Constructor takes in the text string.
TextNode(Page page, int start, int end)
Constructor takes in the page and beginning and ending posns.
Method Summary
voidaccept(NodeVisitor visitor)
String visiting code.
StringgetText()
Returns the text of the node.
booleanisWhiteSpace()
Returns if the node consists of only white space.
voidsetText(String text)
Sets the string contents of the node.
StringtoHtml(boolean verbatim)
Returns the text of the node.
StringtoPlainTextString()
Returns the text of the node.
StringtoString()
Express this string node as a printable string This is suitable for display in a debugger or output to a printout.

Field Detail

mText

protected String mText
The contents of the string node, or override text.

Constructor Detail

TextNode

public TextNode(String text)
Constructor takes in the text string.

Parameters: text The string node text. For correct generation of HTML, this should not contain representations of tags (unless they are balanced).

TextNode

public TextNode(Page page, int start, int end)
Constructor takes in the page and beginning and ending posns.

Parameters: page The page this string is on. start The beginning position of the string. end The ending positiong of the string.

Method Detail

accept

public void accept(NodeVisitor visitor)
String visiting code.

Parameters: visitor The NodeVisitor object to invoke visitStringNode() on.

getText

public String getText()
Returns the text of the node. This is the same as TextNode for this type of node.

Returns: The contents of this text node.

isWhiteSpace

public boolean isWhiteSpace()
Returns if the node consists of only white space. White space can be spaces, new lines, etc.

setText

public void setText(String text)
Sets the string contents of the node.

Parameters: text The new text for the node.

toHtml

public String toHtml(boolean verbatim)
Returns the text of the node.

Parameters: verbatim If true return as close to the original page text as possible.

Returns: The contents of this text node.

toPlainTextString

public String toPlainTextString()
Returns the text of the node. This is the same as TextNode for this type of node.

Returns: The contents of this text node.

toString

public String toString()
Express this string node as a printable string This is suitable for display in a debugger or output to a printout. Control characters are replaced by their equivalent escape sequence and contents is truncated to 80 characters.

Returns: A string representation of the string node.

HTML Parser is an open source library released under LGPL. SourceForge.net