Predicting Machine Translation Performance on Low-Resource Languages: The Role of Domain Similarity.
Eric KhiuHasti ToossiJinyu LiuJiaxu LiDavid AnugrahaJuan Armando Parra FloresLeandro Acros RomanA. Seza DogruözEn-Shiun Annie LeePublished in: EACL (Findings) (2024)
Keyphrases
- machine translation
- target language
- language independent
- cross lingual
- statistical machine translation
- multilingual documents
- machine translation system
- language resources
- language specific
- query translation
- parallel corpora
- source language
- information extraction
- language processing
- similarity measure
- cross language information retrieval
- natural language processing
- chinese english
- bilingual dictionaries
- cross lingual information retrieval
- grammar induction
- word alignment
- natural language generation
- natural language
- comparable corpora
- word level
- machine readable dictionaries
- multilingual information retrieval
- parallel corpus
- word order
- word sense disambiguation
- data mining
- brazilian portuguese
- cross domain
- tasks in natural language processing
- knowledge representation
- broadcast news
- cross language
- word pairs