Harnessing Cross-lingual Features to Improve Cognate Detection for Low-resource Languages.
Diptesh KanojiaRaj DabreShubham DewanganPushpak BhattacharyyaGholamreza HaffariMalhar KulkarniPublished in: CoRR (2021)
Keyphrases
- cross lingual
- language independent
- cross lingual information retrieval
- language modeling
- machine translation
- multi lingual
- european languages
- cross language
- event extraction
- feature vectors
- text classification
- parallel corpus
- query translation
- news articles
- statistical machine translation
- co occurrence
- clustering algorithm
- mono lingual
- translation model
- document clustering
- n gram
- text categorization
- prior knowledge
- feature space
- reinforcement learning