Statistical Approach for Term Weighting in Very Short Documents for Text Categorization.
Mika TimonenMelissa KasariPublished in: IC3K (2012)
Keyphrases
- text categorization
- term weighting
- term frequency
- text documents
- term weights
- document categorization
- feature selection
- text classification
- knn
- tf idf
- k nearest neighbor
- information retrieval
- semi supervised learning
- naive bayes
- information gain
- term weighting methods
- unlabeled data
- nearest neighbor
- machine learning
- language modeling
- decision trees
- document frequency
- neural network