Cross Language Text Categorization by Acquiring Multilingual Domain Models from Comparable Corpora.
Alfio GliozzoCarlo StrapparavaPublished in: ParallelText@ACL (2005)
Keyphrases
- text categorization
- cross language
- comparable corpora
- cross language information retrieval
- bilingual lexicon
- parallel corpora
- language independent
- query translation
- text classification
- text documents
- knn
- document classification
- k nearest neighbor
- semi supervised learning
- feature selection
- text classifiers
- domain knowledge
- unlabeled data
- tf idf
- transfer learning
- translation model
- bilingual dictionaries
- text retrieval
- news articles
- text collections
- question answering
- text mining