An Evaluation of Phrasal and Clustered Representations on a Text Categorization Task.
David D. LewisPublished in: SIGIR (1992)
Keyphrases
- text categorization
- text classification
- knn
- multi label
- k nearest neighbor
- feature selection
- reuters corpus
- information gain
- naive bayes
- automated text categorization
- feature weighting
- feature selections
- document categorization
- text classifiers
- text collections
- text documents
- unlabeled data
- semi supervised learning
- tf idf
- term frequency
- automatic text categorization
- classification accuracy
- machine learning