Cross-language information retrieval models based on latent topic models trained with document-aligned comparable corpora.
Ivan VulicWim De SmetMarie-Francine MoensPublished in: Inf. Retr. (2013)
Keyphrases
- cross language information retrieval
- comparable corpora
- translation model
- terminology extraction
- parallel corpora
- query translation
- machine translation
- query terms
- cross language
- bilingual lexicon
- news articles
- text documents
- linguistic resources
- language model
- document retrieval
- parallel corpus
- information extraction
- statistical models
- document collections
- cross lingual
- statistical machine translation
- language modeling
- bilingual dictionaries
- information retrieval systems
- natural language processing
- information retrieval