Experimental Study on Representing Units in Chinese Text Categorization.
Baoli LiYuzhong ChenXiaojing BaiShiwen YuPublished in: CICLing (2003)
Keyphrases
- text categorization
- experimental study
- text classification
- feature selection
- reuters corpus
- multi label
- information gain
- knn
- k nearest neighbor
- naive bayes
- automated text categorization
- automatic text categorization
- text documents
- feature weighting
- tf idf
- semi supervised learning
- text classifiers
- feature selection for text categorization
- text collections
- term frequency
- experimental evaluation
- term weighting
- document frequency
- feature space
- feature selection and classifier
- data sets