Best terms: an efficient feature-selection algorithm for text categorization.
Dimitris FragoudisDimitris MeretakisSpiridon D. LikothanassisPublished in: Knowl. Inf. Syst. (2005)
Keyphrases
- text categorization
- feature selection
- feature selection algorithms
- term frequency
- text classification
- term selection
- multi label
- term weighting
- knn
- text documents
- document frequency
- k nearest neighbor
- machine learning
- tf idf
- naive bayes
- mutual information
- support vector
- feature weighting
- automatic text categorization
- semi supervised learning
- data sets
- unsupervised learning
- feature extraction
- genetic algorithm
- dimensionality reduction
- knowledge discovery
- pattern recognition
- feature selection for text categorization