Exploiting Extremely Rare Features in Text Categorization.
Péter SchönhofenAndrás A. BenczúrPublished in: ECML (2006)
Keyphrases
- text categorization
- feature generation
- feature weighting
- information gain
- feature selection
- feature reduction
- text classification
- knn
- multi label
- linear svm
- k nearest neighbor
- training documents
- text documents
- semi supervised learning
- automated text categorization
- feature space
- feature set
- image features
- classification accuracy
- feature extraction
- machine learning
- feature selection for text categorization
- naive bayes
- co occurrence
- knowledge discovery
- support vector
- training data
- term frequency
- text classifiers
- automatic text categorization
- reuters corpus
- object recognition