Constructing Parallel Corpora for Six Indian Languages via Crowdsourcing.
Matt PostChris Callison-BurchMiles OsbornePublished in: WMT@NAACL-HLT (2012)
Keyphrases
- parallel corpora
- cross lingual
- indian languages
- cross lingual information retrieval
- machine translation
- language independent
- comparable corpora
- cross language information retrieval
- cross language
- machine translation system
- language identification
- language modeling
- labor intensive
- translation model
- document images
- sentence level
- word pairs
- text classification
- query translation
- sentiment classification
- word segmentation
- wikipedia articles
- artificial intelligence