Impact of Feature Selection on Micro-Text Classification.
Ankit VadehraMaura R. GrossmanGordon V. CormackPublished in: CoRR (2017)
Keyphrases
- text classification
- feature selection
- text categorization
- bag of words
- machine learning
- labeled data
- text mining
- naive bayes
- data cleaning
- text classifiers
- n gram
- feature engineering
- text data
- sentiment analysis
- feature reduction
- text documents
- support vector machine
- web page classification
- feature weighting
- semantic features
- information gain
- mutual information
- support vector
- classification accuracy
- irrelevant features
- unlabeled data
- unsupervised learning
- feature selection algorithms
- feature extraction
- knn
- model selection
- k nearest neighbor
- feature subset
- high dimensionality
- multi task
- semi supervised learning
- microarray data
- information retrieval
- multi label
- databases
- multi class