A new sampling approach for classification of imbalanced data sets with high density.
Pengfei JiaChunkai ZhangZhenyu HePublished in: BigComp (2014)
Keyphrases
- high density
- imbalanced data sets
- imbalanced data
- low density
- minority class
- class imbalance
- benchmark data sets
- decision trees
- imbalanced class distribution
- roc curve
- concept learning
- pattern classification
- machine learning algorithms
- classification accuracy
- classification algorithm
- data center
- support vector
- supervised learning
- class distribution
- feature vectors
- rare events
- decision rules
- classification models
- feature space
- sampling methods
- machine learning
- support vector machine svm
- feature extraction
- high dimensional
- nearest neighbour
- active learning
- text classification
- training samples
- class labels
- benchmark datasets