Filtering of Noisy Parallel Corpora Based on Hypothesis Generation.
Zuzanna ParchetaGermán Sanchis-TrillesFrancisco CasacubertaPublished in: WMT (3) (2019)
Keyphrases
- parallel corpora
- machine translation
- cross language information retrieval
- cross lingual
- machine translation system
- labor intensive
- language independent
- comparable corpora
- query translation
- wikipedia articles
- word pairs
- information retrieval
- sentence level
- fully automated
- sentiment analysis
- wordnet
- natural language processing
- statistical machine translation
- knowledge representation
- feature selection