The MILE Corpus for Less Commonly Taught Languages.
Alison AlvarezLori S. LevinRobert E. FrederkingSimon FungDonna GatesJeff GoodPublished in: HLT-NAACL (2006)
Keyphrases
- statistical machine translation
- expressive power
- language independent
- sentence pairs
- cross lingual
- multi lingual
- linguistic resources
- comparable corpora
- test set
- machine learning
- parallel corpora
- spoken language
- document level
- supervised machine learning
- machine translation system
- manually annotated
- query expansion
- co occurrence
- knowledge representation