NICT's Corpus Filtering Systems for the WMT18 Parallel Corpus Filtering Task.
Rui WangBenjamin MarieMasao UtiyamaEiichiro SumitaPublished in: WMT (shared task) (2018)
Keyphrases
- parallel corpus
- cross lingual
- language independent
- cross language information retrieval
- machine translation
- sentence pairs
- machine translation system
- word alignment
- query translation
- statistical machine translation
- information filtering
- parallel texts
- cross language
- document clustering
- n gram
- transfer learning
- latent semantic analysis
- target language
- wordnet
- digital libraries