Recovering "Lack of Words" in Text Categorization for Item Banks.
Atorn NuntiyagulNick CerconeKanlaya NaruedomkulPublished in: COMPSAC (2) (2005)
Keyphrases
- text categorization
- text documents
- distributional clustering
- training documents
- document frequency
- text classification
- knn
- word frequency
- feature selection
- multi label
- k nearest neighbor
- n gram
- reuters corpus
- information gain
- automated text categorization
- naive bayes
- term frequency
- text classifiers
- text collections
- document clustering
- information theoretic
- term weighting
- feature generation
- document categorization
- semi supervised learning
- keywords
- word sense disambiguation
- tf idf
- similarity measure