t-Test feature selection approach based on term frequency for text categorization.
Deqing WangHui ZhangRui LiuWeifeng LvDatao WangPublished in: Pattern Recognit. Lett. (2014)
Keyphrases
- text categorization
- term frequency
- feature selection
- document frequency
- text classification
- term weighting
- tf idf
- automatic text categorization
- information gain
- text documents
- knn
- naive bayes
- text classifiers
- k nearest neighbor
- semi supervised learning
- feature set
- mutual information
- support vector machine
- retrieved documents
- feature extraction
- unlabeled data
- co occurrence
- classification accuracy
- feature space
- neural network