Extracting Multilingual Topics from Unaligned Comparable Corpora.
Jagadeesh JagarlamudiHal Daumé IIIPublished in: ECIR (2010)
Keyphrases
- comparable corpora
- text corpora
- cross language information retrieval
- news articles
- text documents
- parallel corpora
- bilingual lexicon
- machine translation
- language modeling
- cross lingual
- topic models
- word pairs
- cross language
- information retrieval
- cross lingual information retrieval
- wikipedia articles
- query translation
- text mining
- latent dirichlet allocation
- topic modeling
- bi directional
- search engine
- bilingual dictionaries
- text collections
- language model
- text analysis
- information extraction
- knowledge discovery
- keywords
- text categorization