Word Representations, Seed Lexicons, Mapping Procedures, and Reference Lists: What Matters in Bilingual Lexicon Induction from Comparable Corpora?
Martin LavilleMérième BouhandiEmmanuel MorinPhilippe LanglaisPublished in: Canadian Conference on AI (2020)
Keyphrases
- bilingual lexicon
- comparable corpora
- cross language information retrieval
- word pairs
- parallel corpora
- machine translation
- bilingual dictionaries
- news articles
- cross language
- semi automatically
- language modeling
- text corpora
- translation model
- cross lingual
- query translation
- natural language processing
- term extraction
- language independent
- machine learning
- statistical machine translation
- text documents
- retrieval systems
- similarity measure
- information extraction
- feature selection