Huge Automatically Extracted Training-Sets for Multilingual Word SenseDisambiguation.
Tommaso PasiniFrancesco EliaRoberto NavigliPublished in: LREC (2018)
Keyphrases
- automatically extracted
- training set
- language specific
- n gram
- parallel corpus
- language independent
- digital libraries
- manually created
- co occurrence
- supervised learning
- classification accuracy
- word segmentation
- active learning
- training samples
- visually similar
- cross lingual
- information retrieval
- cross language information retrieval
- training examples
- text classification
- training data
- data sets
- svm classifier
- feature space
- bilingual dictionaries
- decision trees
- indian languages
- language resources
- cross language ir