A Sense Based Similarity Measure for Cross-Lingual Documents.
Hsun-Hui HuangHorng-Chang YangYau-Hwang KuoPublished in: ISDA (1) (2008)
Keyphrases
- cross lingual
- word sense
- similarity measure
- document clustering
- parallel corpus
- parallel corpora
- machine translation
- indian languages
- language independent
- language modeling
- document collections
- cross lingual information retrieval
- cross language
- information retrieval
- text classification
- clustering method
- linguistic resources
- information retrieval systems
- source language
- relevant documents
- machine translation system
- distance measure
- language model
- translation model
- text documents
- query terms
- news articles
- transfer learning
- query translation
- keywords
- pairwise
- retrieval systems
- tf idf
- vector space model
- web documents
- semi supervised
- text categorization
- clustering algorithm
- probabilistic topic models
- machine learning