Table of Contents
index.gloss, gloss.idx - index of words found in WordNet synset glosses
The WordNet gloss index lists all of the content words found
in the synset glosses and identifies the gloss(es) containing the word.
The file stoplist.pl contains a list of functions words that are omitted
from the gloss index.
This file can be used to help find WordNet synsets
that are related topically. For example, the gloss index can be used to
find synsets in all syntactic categories related to the monosemous noun
and verb golf by retrieving all synsets listed in the gloss index after
the strings golf , golfclub , golfer and golfers .
It is important
to note, as indicated in the previous example, that in this file words
are simply strings of consecutive characters found in the glosses, and
are not syntactically tagged or lemmatized. Base forms and various inflections
must be searched for separately. Strings are all in lower case, however,
so gloss terms encountered in synsets in both upper and lower case are
folded into the same word entry.
The gloss index file is
in alphabetical order, fields are separated by one space, and each line
is terminated with a newline character. Items in italicized square brackets
may not be present.
Each line is of the form:
word pos,synset_offset [pos,synset_offset...]
where word is a lower case string of characters as found in a synset
gloss and pos is an integer indicating the syntactic category of the
synset as follows:
1 NOUN
2 VERB
3 ADJECTIVE
4 ADVERB
See wndb(5WN)
for a description of synset_offset .
The gloss index is a very large
file (5.7MB), and is not used by the WordNet searching software. It can
be useful to applications that the user may wish to write, and is therefore
included in the WordNet package. If you are not doing research or development
that uses this file, it can be deleted from the WNSEARCHDIR directory
in order to save disk space.
- WNHOME
- Base directory
for WordNet. Unix default is /usr/local/wordnet1.6 , PC default is C:\wn16
, Macintosh default is : .
- WNSEARCHDIR
- Directory in which the WordNet
database has been installed. Unix and PC default is WNHOME/dict . Macintosh
default is :Database .
All files are in directory WNSEARCHDIR :
- index.gloss
- gloss index (Unix and Macintosh)
- gloss.idx
- gloss index (PC)
- stoplist.pl
- Perl associative array listing function words to ignore when
parsing glosses
senseidx(5WN)
, wndb(5WN)
.
Table of Contents