A comprehensive comparative study on term weighting schemes for text categorization with support vector machines.
Man LanChew Lim TanHwee-Boon LowSam Yuan SungPublished in: WWW (Special interest tracks and posters) (2005)
Keyphrases
- text categorization
- comparative study
- term weighting schemes
- tf idf
- term weighting
- term frequency
- feature selection
- weighting scheme
- text classification
- precision recall
- information retrieval
- k nearest neighbor
- knn
- text documents
- term weights
- vector space model
- semi supervised learning
- unlabeled data
- test collection
- weighting schemes
- cross domain
- labeled data
- learning algorithm
- neural network