Topic tracking based on bilingual comparable corpora and semisupervised clustering.
Fumiyo FukumotoYoshimi SuzukiPublished in: ACM Trans. Asian Lang. Inf. Process. (2007)
Keyphrases
- comparable corpora
- news articles
- cross language information retrieval
- bilingual lexicon
- parallel corpora
- machine translation
- language modeling
- semi supervised
- cross lingual
- clustering algorithm
- k means
- word pairs
- text corpora
- clustering method
- bilingual dictionaries
- document clustering
- information retrieval
- knowledge discovery
- language independent
- labor intensive
- information filtering
- cross language
- statistical machine translation
- probabilistic model
- parallel corpus
- text classification
- text documents