An imbalanced data classification method based on automatic clustering under-sampling.
Xiaoheng DengWeijian ZhongJu RenDetian ZengHonggang ZhangPublished in: IPCCC (2016)
Keyphrases
- classification method
- imbalanced data
- support vector machine
- clustering algorithm
- k nearest neighbor
- knn
- text classification
- feature selection
- support vector machine svm
- classification algorithm
- ensemble methods
- sampling methods
- decision trees
- unsupervised learning
- high dimensionality
- linear regression
- random forest
- data mining
- data analysis
- class imbalance
- data sets
- active learning
- feature space