Document-self expansion for text categorization.
Yuen-Hsien TsengDa-Wei JuangPublished in: SIGIR (2003)
Keyphrases
- text categorization
- document classification
- text documents
- term frequency
- text classifiers
- training documents
- automatic categorization
- document categorization
- text collections
- tf idf
- automatic text categorization
- text classification
- knn
- feature selection
- multi label
- reuters corpus
- document frequency
- term weighting
- information gain
- automated text categorization
- naive bayes
- k nearest neighbor
- information retrieval
- classify documents
- document clustering
- semi supervised learning
- information retrieval systems
- term selection
- unlabeled data
- document representation
- document collections
- relevant documents
- word frequency
- query expansion
- keywords
- neural network
- feature selections