A novel feature selection algorithm for text categorization.
Wenqian ShangHoukuan HuangHaibin ZhuYongmin LinYouli QuZhihai WangPublished in: Expert Syst. Appl. (2007)
Keyphrases
- text categorization
- feature selection algorithms
- feature selection
- text classification
- mutual information
- k nearest neighbor
- text documents
- multi label
- support vector machine
- knn
- feature weighting
- naive bayes
- classification accuracy
- data sets
- feature subset
- machine learning
- dimensionality reduction
- feature set
- feature space
- reuters corpus
- database
- support vector
- classification models
- decision trees
- nearest neighbor
- unsupervised learning
- prior knowledge
- training data
- feature extraction
- tf idf
- neural network