Noisy Parallel Corpus Filtering through Projected Word Embeddings.
Murathan KurfaliRobert ÖstlingPublished in: WMT (3) (2019)
Keyphrases
- parallel corpus
- cross lingual
- cross language information retrieval
- word alignment
- machine translation
- machine translation system
- statistical machine translation
- language independent
- query translation
- sentence pairs
- vector space
- target language
- language modeling
- information filtering
- context sensitive
- low dimensional
- source language
- semantic space
- document clustering
- text categorization
- co occurrence
- parallel corpora
- bilingual dictionaries
- digital libraries
- bayesian networks
- information retrieval