A novel feature selection technique for enhancing performance of unbalanced text classification problem.
Santosh Kumar BeheraRajashree DashPublished in: Intell. Decis. Technol. (2022)
Keyphrases
- text classification
- feature selection
- text categorization
- bag of words
- naive bayes
- text data
- multi label
- web page classification
- feature weighting
- text classifiers
- sentiment analysis
- text documents
- text mining
- labeled data
- information gain
- feature selection algorithms
- supervised feature selection
- machine learning
- support vector
- support vector machine
- k nearest neighbor
- mutual information
- multi class
- selected features
- classification accuracy
- knn
- feature engineering
- feature reduction
- irrelevant features
- bayes classifier
- term frequency
- high dimensionality
- dimensionality reduction
- n gram
- unsupervised learning
- feature subset
- microarray data
- feature space
- data cleaning
- model selection
- feature extraction
- data integration
- data mining