Language and Task Independent Text Categorization with Simple Language Models.
Fuchun PengDale SchuurmansShaojun WangPublished in: HLT-NAACL (2003)
Keyphrases
- text categorization
- language model
- language modeling
- text classification
- feature selection
- n gram
- document retrieval
- knn
- text documents
- multi label
- naive bayes
- information retrieval
- retrieval model
- k nearest neighbor
- probabilistic model
- query expansion
- term weighting
- term frequency
- test collection
- text classifiers
- semi supervised learning
- pseudo relevance feedback
- cross language
- query terms
- tf idf
- document representation
- ir models
- unlabeled data
- vector space model
- neural network
- machine translation
- nearest neighbor