A Fast Distributed Classification Algorithm for Large-Scale Imbalanced Data.
Huihui WangYang GaoYinghuan ShiHao WangPublished in: ICDM (2016)
Keyphrases
- classification algorithm
- imbalanced data
- support vector machine
- base learners
- training set
- naive bayes
- class distribution
- concept drift
- k nearest neighbor
- knn
- class imbalance
- class labels
- feature selection
- learning algorithm
- decision trees
- classification rules
- random forest
- svm classifier
- sampling methods
- clustering algorithm
- ensemble methods
- high dimensionality
- data analysis
- support vector machine svm
- multi class
- training data
- test set
- training examples
- data mining
- linear regression
- unsupervised learning
- text mining
- classification error
- least squares
- nearest neighbor
- decision boundary
- feature vectors