Dual Conditional Cross-Entropy Filtering of Noisy Parallel Corpora.
Marcin Junczys-DowmuntPublished in: CoRR (2018)
Keyphrases
- cross entropy
- parallel corpora
- cross lingual
- cross language information retrieval
- machine translation
- language modeling
- maximum likelihood
- labor intensive
- log likelihood
- language independent
- word pairs
- evaluation metrics
- machine translation system
- cross language
- error function
- statistical machine translation
- wikipedia articles
- query translation
- machine learning
- sentence level
- ranking functions
- information theoretic
- scoring function
- neural network
- information extraction
- language model
- question answering