Exploiting Comparable Corpora and Bilingual Dictionaries for Cross-Language Text Categorization.
Alfio Massimiliano GliozzoCarlo StrapparavaPublished in: ACL (2006)
Keyphrases
- cross language
- text categorization
- comparable corpora
- cross language information retrieval
- parallel corpora
- bilingual dictionaries
- query translation
- text classification
- knn
- language independent
- feature selection
- text documents
- k nearest neighbor
- semi supervised learning
- translation model
- cross lingual
- text collections
- text classifiers
- unlabeled data
- semi supervised
- statistical machine translation
- nearest neighbor
- data mining
- digital libraries
- text clustering
- labeled data
- tf idf
- transfer learning