Module nltk_lite.corpora.senseval
Read from the Senseval 2 Corpus.
SENSEVAL [http://www.senseval.org/] Evaluation exercises for Word
Sense Disambiguation. Organized by ACL-SIGLEX
[http://www.siglex.org/]
Prepared by Ted Pedersen <tpederse@umn.edu>, University of
Minnesota, http://www.d.umn.edu/~tpederse/data.html Distributed with
permission.
The NLTK version of the Senseval 2 files uses well-formed XML. Each
instance of the ambiguous words "hard", "interest",
"line", and "serve" is tagged with a sense
identifier, and supplied with context.
Function Summary |
|
demo()
|
iterator over tuple
|
raw (files)
|
|
_to_ascii(text)
|
Variable Summary |
list |
items = ['hard', 'interest', 'line', 'serve']
|
raw(files=['hard', 'interest', 'line', 'serve'])
-
- Parameters:
files -
One or more Senseval files to be processed
(type=string or tuple(string) )
- Returns:
-
iterator over
tuple
|
items
-
- Type:
-
list
- Value:
['hard', 'interest', 'line', 'serve']
|
|