Train-O-Matic: Large-Scale Supervised Word Sense Disambiguation in Multiple Languages without Manual Training Data

Word Sense Disambiguation Training set
DOI: 10.18653/v1/d17-1008 Publication Date: 2018-01-18T11:54:31Z
ABSTRACT
Annotating large numbers of sentences with senses is the heaviest requirement current Word Sense Disambiguation. We present Train-O-Matic, a language-independent method for generating millions sense-annotated training instances virtually all meanings words in language's vocabulary. The approach fully automatic: no human intervention required and only type knowledge used WordNet-like resource. Train-O-Matic achieves consistently state-of-the-art performance across gold standard datasets languages, while at same time removing burden manual annotation. All data available research purposes http://trainomatic.org.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (0)
CITATIONS (13)