Improving automatic Chinese text categorization by error correction.
Jyh-Jong TsayJing-Doo WangPublished in: IRAL (2000)
Keyphrases
- text categorization
- error correction
- text classification
- knn
- k nearest neighbor
- information gain
- text documents
- feature weighting
- feature selection
- error detection
- multi label
- term weighting
- text classifiers
- naive bayes
- error correcting
- semi supervised learning
- error detection and correction
- feature selections
- semantic browsing
- term frequency
- automatic text categorization
- reuters corpus
- automated text categorization
- unlabeled data
- document frequency
- high dimensional
- machine learning
- data sets