The ineffectiveness of within-document term frequency in text classification.
W. John WilburWon KimPublished in: Inf. Retr. (2009)
Keyphrases
- term frequency
- text classification
- text categorization
- bag of words
- text documents
- term weighting
- document frequency
- feature selection
- text data
- naive bayes
- text mining
- text classifiers
- machine learning
- n gram
- k nearest neighbor
- knn
- tf idf
- document representation
- inverse document frequency
- average precision
- retrieved documents
- language modeling
- labeled data
- named entities
- information extraction
- information retrieval