A Study on Term Weighting for Text Categorization: A Novel Supervised Variant of tf.idf.
Giacomo DomeniconiGianluca MoroRoberto PasoliniClaudio SartoriPublished in: DATA (2015)
Keyphrases
- text categorization
- tf idf
- term weighting
- term frequency
- feature selection
- inverse document frequency
- text documents
- text classification
- k nearest neighbor
- information retrieval
- vector space model
- information gain
- knn
- semi supervised learning
- term weights
- retrieval model
- text retrieval
- document frequency
- term weighting schemes
- retrieval systems
- feature extraction
- learning algorithm
- image classification
- document clustering
- wordnet