Corpus-Based Translation Induction in Indian Languages Using Auxiliary Language Corpora from Wikipedia.
Goutham TholpadiChiranjib BhattacharyyaShirish K. ShevadePublished in: ACM Trans. Asian Low Resour. Lang. Inf. Process. (2017)
Keyphrases
- indian languages
- cross lingual
- parallel corpus
- cross lingual information retrieval
- comparable corpora
- machine translation
- parallel corpora
- chinese english
- linguistic resources
- document images
- language identification
- statistical machine translation
- target language
- computational linguistics
- machine translation system
- query translation
- natural language processing
- wikipedia articles
- bilingual dictionaries
- wordnet
- source language
- language independent
- cross language information retrieval
- multi lingual
- cross language
- language modeling
- natural language
- translation model
- document collections
- document clustering
- spoken language
- english text
- named entities
- document analysis
- keywords
- active learning
- language processing