Feature selection for high-dimensional imbalanced data.
Liuzhi YinYong GeKeli XiaoXuehua WangXiaojun QuanPublished in: Neurocomputing (2013)
Keyphrases
- imbalanced data
- feature selection
- high dimensional
- high dimensionality
- dimensionality reduction
- feature space
- classification models
- support vector machine
- text categorization
- high dimensional data
- feature selection algorithms
- low dimensional
- linear regression
- feature set
- class imbalance
- text classification
- ensemble classifier
- training samples
- decision trees
- sampling methods
- classification accuracy
- class distribution
- random forest
- knn
- nearest neighbor
- ensemble methods
- feature extraction
- preprocessing step
- unsupervised learning
- feature subset
- model selection
- multi class
- svm classifier
- ensemble learning
- least squares
- naive bayes
- support vector machine svm
- support vector
- principal component analysis
- labeled data