Bilingual topic taxonomy generation based on bilingual documents clustering.
Cheng-Zhi ZhangPublished in: ICMLC (2011)
Keyphrases
- parallel corpora
- cross lingual
- document clustering
- multiword
- parallel corpus
- machine translation
- bilingual lexicon
- comparable corpora
- chinese english
- topic discovery
- word alignment
- document collections
- clustering algorithm
- query translation
- topic segmentation
- cross language
- cross language information retrieval
- topic modeling
- text documents
- document content
- topic detection
- expert finding
- text clustering
- clustering method
- document set
- information retrieval
- news articles
- word pairs
- k means
- information retrieval systems
- topic models
- relevant documents
- topic hierarchy
- xml documents
- indian languages
- text corpora
- bilingual dictionaries
- document classification
- source language
- topic specific
- web documents
- query topic
- news stories
- metadata