Creation of comparable corpora for English-Urdu, Arabic, Persian.
Murad AbouammohKashif ShahAhmet AkerPublished in: LREC (2016)
Keyphrases
- comparable corpora
- language identification
- cross language information retrieval
- parallel corpora
- cross lingual
- machine translation
- language modeling
- news articles
- bilingual lexicon
- text corpora
- text classification
- sentiment analysis
- cross language
- language model
- query translation
- statistical machine translation
- text retrieval
- text documents
- word pairs
- word sense disambiguation
- bi directional
- translation model
- bilingual dictionaries
- n gram
- knowledge base
- parallel corpus
- linguistic resources
- text mining
- natural language processing