Bagging Using Instance-Level Difficulty for Multi-Class Imbalanced Big Data Classification on Spark.
William C. Sleeman IVBartosz KrawczykPublished in: IEEE BigData (2019)
Keyphrases
- big data
- class imbalanced
- instance level
- decision trees
- imbalanced data
- cloud computing
- machine learning
- training set
- classification accuracy
- data management
- pattern classification
- benchmark datasets
- feature selection
- ensemble methods
- feature extraction
- machine learning algorithms
- class labels
- class imbalance
- machine learning methods
- data analysis
- learning algorithm
- cross validation
- data processing
- feature set
- database systems
- supervised learning
- ensemble learning
- class distribution
- support vector machine
- knowledge discovery
- databases