Machine Translation Customization via Automatic Training Data Selection from the Web.
Thuy VuAlessandro MoschittiPublished in: CoRR (2021)
Keyphrases
- machine translation
- training data
- multilingual documents
- language independent
- information extraction
- cross lingual
- natural language processing
- language processing
- target language
- natural language
- word sense disambiguation
- cross language information retrieval
- natural language generation
- statistical machine translation
- machine translation system
- web documents
- chinese english
- parallel corpora
- learning algorithm
- word alignment
- web pages
- language resources
- word level
- training set
- information retrieval
- machine transliteration
- brazilian portuguese