A novel refinement approach for text categorization.
Songbo TanXueqi ChengMoustafa GhanemBin WangHongbo XuPublished in: CIKM (2005)
Keyphrases
- text categorization
- text classification
- feature selection
- knn
- k nearest neighbor
- multi label
- reuters corpus
- automated text categorization
- naive bayes
- information gain
- term frequency
- semi supervised learning
- document categorization
- feature weighting
- unlabeled data
- document frequency
- text documents
- text classifiers
- mutual information
- text collections
- document classification
- data sets
- feature space
- term selection
- unsupervised learning
- language model
- semantic browsing