A class-feature-centroid classifier for text categorization.
Hu GuanJingyu ZhouMinyi GuoPublished in: WWW (2009)
Keyphrases
- text categorization
- feature selection
- text classifiers
- feature weighting
- feature set
- training documents
- multi label
- linear svm
- feature reduction
- feature selection and classifier
- text classification
- multi label classification
- class labels
- knn
- document classification
- distributional clustering
- information gain
- automatic text categorization
- feature subset
- feature space
- text documents
- classify documents
- reuters corpus
- feature selection for text categorization
- k nearest neighbor
- training data
- term frequency
- multiple features
- feature selections
- decision trees
- feature vectors
- support vector
- neural network
- classification algorithm
- semi supervised learning
- mutual information
- automated text categorization
- information extraction
- machine learning
- training set
- classification accuracy
- information theoretic
- class probabilities
- nearest neighbor
- comparative evaluation
- bag of words