Distributional Character Clustering for Chinese Text Categorization.
Xuezhong ZhouZhaohui WuPublished in: PRICAI (2004)
Keyphrases
- text categorization
- text clustering
- knn
- pattern recognition and machine learning
- text classification
- feature selection
- distributional clustering
- information gain
- k means
- clustering method
- clustering algorithm
- automatic text categorization
- text classifiers
- multi label
- text documents
- k nearest neighbor
- automated text categorization
- feature reduction
- document categorization
- feature weighting
- naive bayes
- reuters corpus
- text collections
- unsupervised learning
- semi supervised learning
- co occurrence
- document clustering
- tf idf
- feature selections
- feature selection and classifier
- term frequency
- information theoretic
- mutual information
- support vector
- data points
- keywords
- reinforcement learning
- knowledge discovery
- vector space
- decision trees
- multi instance multi label learning