OpenSubtitles2018: Statistical Rescoring of Sentence Alignments in Large, Noisy Parallel Corpora.
Pierre LisonJörg TiedemannMilen KouylekovPublished in: LREC (2018)
Keyphrases
- sentence pairs
- parallel corpora
- parallel corpus
- word pairs
- machine translation
- machine translation system
- word level
- sentence level
- cross lingual
- cross language information retrieval
- language independent
- labor intensive
- statistical machine translation
- wikipedia articles
- statistical models
- machine learning
- target language
- query translation
- social media