Self-paced Ensemble for Highly Imbalanced Massive Data Classification.
Zhining LiuWei CaoZhifeng GaoJiang BianHechang ChenYi ChangTie-Yan LiuPublished in: CoRR (2019)
Keyphrases
- massive data
- highly imbalanced
- classification accuracy
- feature selection
- feature extraction
- machine learning
- decision trees
- data mining applications
- support vector
- text classification
- training set
- feature space
- class distribution
- imbalanced data
- class labels
- class imbalance
- supervised learning
- classification models
- ensemble classifier
- support vector machine svm
- data sets
- preprocessing
- training data
- databases
- big data
- training samples
- random forest
- support vector machine
- database systems
- learning algorithm