Text Categorization Based on Subtopic Clusters.
Francis C. Y. ChikRobert Wing Pong LukKorris Fu-Lai ChungPublished in: NLDB (2005)
Keyphrases
- text categorization
- feature selection
- knn
- multi label
- text classification
- clustering algorithm
- reuters corpus
- k nearest neighbor
- unlabeled data
- information gain
- automated text categorization
- automatic text categorization
- document set
- text documents
- naive bayes
- machine learning
- data points
- test collection
- tf idf
- term frequency
- document clustering
- text collections
- semi supervised learning
- labeled data
- multi instance multi label learning
- feature selections