An improved centroid classifier for text categorization.
Songbo TanPublished in: Expert Syst. Appl. (2008)
Keyphrases
- text categorization
- text classifiers
- feature selection
- training documents
- feature weighting
- feature reduction
- feature selection and classifier
- text classification
- multi label classification
- knn
- multi label
- text documents
- k nearest neighbor
- naive bayes
- classify documents
- document classification
- information gain
- reuters corpus
- feature set
- training data
- automated text categorization
- linear svm
- accurate classifiers
- classification algorithm
- automatic text categorization
- decision trees
- feature selections
- tf idf
- term frequency
- semantic browsing
- feature selection for text categorization
- feature space
- training set
- svm classifier
- support vector machine
- unlabeled data
- semi supervised learning
- support vector
- vector space
- feature extraction
- machine learning
- class labels
- multi instance multi label learning
- nearest neighbor
- mutual information