A Comparative Study on Feature Selection in Text Categorization.
Yiming YangJan O. PedersenPublished in: ICML (1997)
Keyphrases
- text categorization
- feature selection
- automated text categorization
- text classification
- multi label
- information gain
- feature weighting
- text filtering
- knn
- naive bayes
- text documents
- feature selections
- reuters corpus
- feature generation
- feature set
- k nearest neighbor
- semi supervised learning
- mutual information
- machine learning
- automatic text categorization
- semantic browsing
- feature extraction
- dimensionality reduction
- term frequency
- feature selection and classifier
- feature selection for text categorization
- tf idf
- document categorization
- document frequency
- linear svm
- text classifiers
- active learning
- model selection
- decision trees
- classification accuracy
- natural language processing
- nearest neighbor