A Weighted Cluster-based Chinese Text Categorization Approach: Incorporating with Word Clusters.
Yu-Chieh WuJie-Chi YangPublished in: IIAI-AAI (2012)
Keyphrases
- text categorization
- term weighting
- term frequency
- document frequency
- distributional clustering
- text classification
- knn
- clustering algorithm
- reuters corpus
- word frequency
- multi label
- feature selection
- text documents
- information gain
- k nearest neighbor
- naive bayes
- n gram
- semi supervised learning
- text collections
- co occurrence
- automatic text categorization
- word sense disambiguation
- text summarization
- support vector machine
- data points
- keywords
- data mining
- feature selection for text categorization