Words as Rules: Feature Selection in Text Categorization.
Elena MontañésElías F. CombarroIrene DíazJosé RanillaJosé Ramón QuevedoPublished in: International Conference on Computational Science (2004)
Keyphrases
- text categorization
- feature selection
- text documents
- distributional clustering
- document frequency
- text classification
- training documents
- automated text categorization
- word frequency
- knn
- multi label
- reuters corpus
- information gain
- k nearest neighbor
- n gram
- text classifiers
- feature weighting
- tf idf
- classification accuracy
- association rules
- semi supervised learning
- feature generation
- machine learning
- naive bayes
- text filtering
- mutual information
- automatic text categorization
- feature selections
- information theoretic
- text collections
- term frequency
- data sets
- document representation
- model selection
- dimensionality reduction
- data analysis
- keywords
- feature selection for text categorization