Text Categorization Using Distributional Clustering and Concept Extraction.
Yifan HeMinghu JiangPublished in: ICIC (1) (2007)
Keyphrases
- text categorization
- distributional clustering
- information theoretic
- text classification
- feature selection
- knn
- k nearest neighbor
- semi supervised learning
- multi label
- reuters corpus
- information gain
- mutual information
- feature weighting
- automatic text categorization
- document categorization
- automated text categorization
- tf idf
- text documents
- naive bayes
- feature selections
- term frequency
- semi supervised
- decision trees