Improving linear classifier for Chinese text categorization.
Jyh-Jong TsayJing-Doo WangPublished in: Inf. Process. Manag. (2004)
Keyphrases
- data mining
- text categorization
- linear classifiers
- feature selection
- multi class
- multi label
- document classification
- knn
- text classification
- k nearest neighbor
- hyperplane
- information gain
- generalization error
- data analysis
- semi supervised learning
- naive bayes
- text documents
- linear svm
- principal components
- principal component analysis
- training data
- svm classifier
- unsupervised learning
- active learning
- pairwise
- information retrieval