Huge Automatically Extracted Training Sets for Multilingual Word Sense Disambiguation.
Tommaso PasiniFrancesco Maria EliaRoberto NavigliPublished in: CoRR (2018)
Keyphrases
- automatically extracted
- word sense disambiguation
- training set
- lexical knowledge
- multilingual information retrieval
- wordnet
- wide coverage
- machine translation
- natural language processing
- classification accuracy
- cross lingual
- linguistic knowledge
- semantic similarity
- language independent
- semantic relatedness
- supervised learning
- word sense
- information extraction
- part of speech
- cross language information retrieval
- unsupervised word sense disambiguation
- active learning
- feature set
- domain knowledge
- lexical information
- decision trees
- feature selection
- semantic relations
- semantic information
- co occurrence
- training data
- sense disambiguation
- data sets