|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
public interface BoilerpipeExtractor
Describes a complete filter pipeline.
Method Summary | |
---|---|
java.lang.String |
getText(org.xml.sax.InputSource is)
Extracts text from the HTML code available from the given InputSource . |
java.lang.String |
getText(java.io.Reader r)
Extracts text from the HTML code available from the given Reader . |
java.lang.String |
getText(java.lang.String html)
Extracts text from the HTML code given as a String. |
java.lang.String |
getText(TextDocument doc)
Extracts text from the given TextDocument object. |
Methods inherited from interface de.l3s.boilerpipe.BoilerpipeFilter |
---|
process |
Method Detail |
---|
java.lang.String getText(java.lang.String html) throws BoilerpipeProcessingException
html
- The HTML code as a String.
BoilerpipeProcessingException
java.lang.String getText(org.xml.sax.InputSource is) throws BoilerpipeProcessingException
InputSource
.
is
- The InputSource containing the HTML
BoilerpipeProcessingException
java.lang.String getText(java.io.Reader r) throws BoilerpipeProcessingException
Reader
.
r
- The Reader containing the HTML
BoilerpipeProcessingException
java.lang.String getText(TextDocument doc) throws BoilerpipeProcessingException
TextDocument
object.
doc
- The TextDocument
.
BoilerpipeProcessingException
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |