Hierarchical Document Encoder for Parallel Corpus Mining.
Mandy GuoYinfei YangKeith StevensDaniel CerHeming GeYun-Hsuan SungBrian StropeRay KurzweilPublished in: CoRR (2019)
Keyphrases
- parallel corpus
- cross lingual
- source language
- cross language information retrieval
- information retrieval
- semantic space
- latent semantic analysis
- document clustering
- data mining
- knowledge discovery
- web documents
- information retrieval systems
- keywords
- machine translation system
- text mining
- machine translation
- vector space model
- user queries
- statistical machine translation
- document retrieval
- text documents
- language independent
- query translation
- document collections
- retrieval systems