Selecting Backtranslated Data from Multiple Sources for Improved Neural Machine Translation.
Xabier SotoDimitar Sht. ShterionovAlberto PoncelasAndy WayPublished in: CoRR (2020)
Keyphrases
- machine translation
- data from multiple sources
- natural language processing
- language independent
- cross lingual
- information extraction
- cross language information retrieval
- chinese english
- multiple sources
- language resources
- statistical machine translation
- word alignment
- brazilian portuguese
- data integration
- target language
- query translation
- multiple data sources
- spatial data
- data cleaning
- machine translation system
- parallel corpus
- natural language
- data processing
- source language
- word level
- relational databases
- feature selection
- databases
- data sets