WebCrawl African : A Multilingual Parallel Corpora for African Languages.
Pavanpankaj VegiSivabhavani J.Biswajit PaulAbhinav MishraPrashant BanjareK. R. Prasanna KumarChitra ViswanathanPublished in: WMT (2022)
Keyphrases
- parallel corpora
- cross lingual
- comparable corpora
- language independent
- cross language information retrieval
- cross lingual information retrieval
- language resources
- machine translation
- bilingual dictionaries
- machine translation system
- cross language
- query translation
- statistical machine translation
- linguistic resources
- language modeling
- word pairs
- chinese english
- sentence pairs
- labor intensive
- sentence level
- text classification
- translation model
- sentiment classification
- parallel corpus
- document retrieval
- digital libraries
- news articles
- question answering
- document collections
- information retrieval
- out of vocabulary