Emerging Language Spaces Learned From Massively Multilingual Corpora.
Jörg TiedemannPublished in: DHN (2018)
Keyphrases
- parallel corpus
- comparable corpora
- cross language information retrieval
- language specific
- linguistic resources
- language resources
- cross lingual
- chinese english
- programming language
- machine translation system
- language learning
- natural language
- natural language processing
- digital libraries
- machine learning
- parallel corpora
- language independent
- machine translation
- lexical knowledge
- data mining
- query translation
- cross language
- hand crafted
- bilingual dictionaries
- text generation
- bilingual lexicon
- word alignment
- statistical machine translation
- translation model
- computational linguistics
- massively parallel
- news articles
- n gram
- question answering
- query expansion
- language model