Revisiting two-stage feature selection based on coverage policies for text classification.
Arquímides Méndez-MolinaAna Li Oña-GarcíaJesús Ariel Carrasco-OchoaJosé Fco. Martínez-TrinidadPublished in: J. Intell. Fuzzy Syst. (2018)
Keyphrases
- text classification
- feature selection
- text categorization
- naive bayes
- feature engineering
- web page classification
- bag of words
- n gram
- optimal policy
- labeled data
- classification accuracy
- knn
- text mining
- text data
- information gain
- text documents
- support vector
- multi label
- mutual information
- data cleaning
- text classifiers
- unlabeled data
- unsupervised learning
- machine learning
- feature space
- supervised feature selection
- text classification tasks
- feature selection algorithms
- feature set
- model selection
- feature subset
- natural language
- training data
- decision trees