Incorporating topical support documents into a small training set in text categorization.
Kyung Soon LeePublished in: CIKM (2008)
Keyphrases
- text categorization
- document classification
- text documents
- automatic text categorization
- automatic categorization
- text classifiers
- training documents
- document categorization
- text classification
- term frequency
- text collections
- term selection
- multi label
- feature selection
- information gain
- knn
- classify documents
- reuters corpus
- k nearest neighbor
- naive bayes
- term weighting
- distributional clustering
- keywords
- word frequency
- information retrieval
- unlabeled data
- document frequency
- document retrieval
- web documents
- information retrieval systems
- training data
- data mining
- neural network
- data sets
- tf idf
- nearest neighbor
- training set
- metadata