Large Scale Multilingual Broadcast Data Collection to Support Machine Translation and Distillation Technology Development.
Kevin WalkerChristopher CarusoDenise DiPersioPublished in: LREC (2010)
Keyphrases
- machine translation
- cross lingual
- data collection
- language independent
- cross language information retrieval
- language resources
- chinese english
- machine translation system
- multilingual documents
- natural language processing
- information extraction
- statistical machine translation
- language processing
- cross language
- language specific
- natural language generation
- target language
- query translation
- word sense disambiguation
- comparable corpora
- parallel corpus
- cross lingual information retrieval
- word alignment
- parallel corpora
- digital libraries
- bilingual lexicon
- multilingual information retrieval
- data mining
- tasks in natural language processing
- word level
- text mining
- natural language
- information retrieval