Benchmarking of English-Hindi parallel corpora.
Jayendra Rakesh YekaPrasanth KolachinaDipti Misra SharmaPublished in: LREC (2014)
Keyphrases
- parallel corpora
- comparable corpora
- machine translation
- statistical machine translation
- cross lingual
- cross language information retrieval
- english chinese
- query translation
- language independent
- machine translation system
- cross language
- target language
- cross lingual information retrieval
- word pairs
- sentence pairs
- bilingual dictionaries
- training corpus
- news articles
- natural language processing
- labor intensive
- indian languages
- translation model
- language model
- source language
- information retrieval
- language modeling
- natural language
- sentence level
- wikipedia articles
- named entity recognition
- information extraction