Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining.
Ivana KvapilíkováMikel ArtetxeGorka LabakaEneko AgirreOndrej BojarPublished in: ACL (student) (2020)
Keyphrases
- parallel corpus
- cross lingual
- cross language information retrieval
- language independent
- sentence pairs
- machine translation
- word alignment
- machine translation system
- query translation
- statistical machine translation
- cross lingual information retrieval
- target language
- knowledge discovery
- semi supervised
- data mining
- cross language
- text mining
- vector space
- low dimensional
- source language
- parallel corpora
- bayesian networks
- document retrieval
- translation model
- text classification
- supervised learning
- clustering algorithm