Building Large Monolingual Dictionaries at the Leipzig Corpora Collection: From 100 to 200 Languages.
Dirk GoldhahnThomas EckartUwe QuasthoffPublished in: LREC (2012)
Keyphrases
- bilingual dictionaries
- cross lingual
- statistical machine translation
- parallel corpora
- linguistic resources
- machine translation
- comparable corpora
- chinese english
- machine readable dictionaries
- cross language information retrieval
- parallel corpus
- query translation
- multilingual information retrieval
- language independent
- language resources
- target language
- european languages
- natural language processing
- parallel texts
- text collections
- document collections
- cross lingual information retrieval
- cross language
- machine translation system
- information retrieval
- question answering
- expressive power
- translation model
- cross language retrieval
- query expansion
- ad hoc retrieval
- source language
- language model
- language modeling
- retrieval systems
- web retrieval
- lexical knowledge