Measures of Rule Quality for Feature Selection in Text Categorization.
Elena MontañésJavier FernándezIrene DíazElías F. CombarroJosé RanillaPublished in: IDA (2003)
Keyphrases
- text categorization
- feature selection
- text classification
- naive bayes
- information gain
- text filtering
- reuters corpus
- multi label
- text documents
- automated text categorization
- document classification
- term frequency
- feature generation
- knn
- semi supervised learning
- tf idf
- text classifiers
- document frequency
- automatic text categorization
- k nearest neighbor
- feature selections
- machine learning
- support vector machine
- linear svm
- text collections
- distributional clustering