Optimized hybrid imbalanced data sampling for decision tree training.
Weronika WegierMichal KoziarskiMichal WozniakPublished in: GECCO Companion (2023)
Keyphrases
- imbalanced data
- decision trees
- random forest
- classification models
- ensemble methods
- training set
- learning from imbalanced data
- sampling methods
- class distribution
- ensemble classifier
- training data
- test set
- imbalanced datasets
- training process
- logistic regression
- class imbalance
- base classifiers
- linear regression
- support vector machine
- feature selection
- random forests
- neural network
- machine learning algorithms
- naive bayes
- machine learning
- training examples
- supervised learning
- attribute selection
- decision rules
- multi class
- active learning
- imbalanced class distribution