Chinese Text Categorization via Bottom-Up Weighted Word Clustering.
Yu-Chieh WuPublished in: Int. J. Enterp. Inf. Syst. (2015)
Keyphrases
- text categorization
- distributional clustering
- text classification
- information theoretic
- term frequency
- term weighting
- text clustering
- document frequency
- knn
- pattern recognition and machine learning
- feature selection
- naive bayes
- clustering algorithm
- automated text categorization
- word frequency
- multi label
- information gain
- text documents
- document categorization
- reuters corpus
- text classifiers
- automatic text categorization
- n gram
- semi supervised learning
- k nearest neighbor
- k means
- clustering method
- document clustering
- unsupervised learning
- information retrieval
- feature selection for text categorization
- tf idf
- word sense disambiguation
- co occurrence
- learning algorithm