Automatic Category Theme Identification and Hierarchy Generation for Chinese Text Categorization.
Hsin-Chang YangChung-Hong LeePublished in: J. Intell. Inf. Syst. (2005)
Keyphrases
- text categorization
- text classification
- training documents
- knn
- multi label
- feature selection
- classify documents
- k nearest neighbor
- text documents
- text classifiers
- information gain
- reuters corpus
- naive bayes
- document categorization
- automated text categorization
- feature selection for text categorization
- automatic text categorization
- semi supervised learning
- feature weighting
- tf idf
- term frequency
- feature selections
- unlabeled data
- text collections
- support vector machine
- feature extraction
- data sets
- n gram
- nearest neighbor
- document frequency
- reinforcement learning
- information retrieval