The IIT Bombay English-Hindi Parallel Corpus.
Anoop KunchukuttanPratik MehtaPushpak BhattacharyyaPublished in: LREC (2018)
Keyphrases
- parallel corpus
- machine translation
- cross lingual
- statistical machine translation
- source language
- target language
- query translation
- cross language information retrieval
- word alignment
- indian languages
- machine translation system
- language independent
- comparable corpora
- sentence pairs
- cross language
- parallel corpora
- named entity recognition
- document clustering
- text classification
- n gram
- clustering method
- query expansion
- natural language processing
- information extraction
- natural language
- search engine