The Hybrid Filter Feature Selection Methods for Improving High-Dimensional Text Categorization.
Le Nguyen Hoai NamHo Bao QuocPublished in: Int. J. Uncertain. Fuzziness Knowl. Based Syst. (2017)
Keyphrases
- text categorization
- high dimensional
- feature selection
- knn
- text classification
- multi label
- k nearest neighbor
- semi supervised learning
- low dimensional
- automated text categorization
- reuters corpus
- information gain
- text documents
- nearest neighbor
- similarity search
- naive bayes
- feature weighting
- semantic browsing
- tf idf
- document categorization
- dimensionality reduction
- training data
- text data
- term frequency
- pairwise
- automatic text categorization
- text classifiers
- feature selection for text categorization
- feature selections
- training documents
- unlabeled data
- classification accuracy
- knowledge discovery
- supervised learning
- learning algorithm