Effect of term distributions on centroid-based text categorization.
Verayuth LertnatteeThanaruk TheeramunkongPublished in: Inf. Sci. (2004)
Keyphrases
- text categorization
- term frequency
- term weighting
- document frequency
- term selection
- text classification
- feature selection
- knn
- reuters corpus
- multi label
- k nearest neighbor
- information gain
- automated text categorization
- semi supervised learning
- text documents
- tf idf
- automatic text categorization
- nearest neighbor
- word frequency
- document categorization
- feature selection for text categorization
- feature weighting
- text classifiers
- unlabeled data
- data mining
- query terms
- naive bayes
- natural language processing
- transductive support vector machine
- neural network