Integrating oversampling and ensemble-based machine learning techniques for an imbalanced dataset in dyslexia screening tests.
Shahriar KaisarAbdullahi ChowdhuryPublished in: ICT Express (2022)
Keyphrases
- imbalanced datasets
- ensemble methods
- class imbalance
- imbalanced data
- machine learning methods
- learning from imbalanced data
- class distribution
- minority class
- ensemble learning
- random forest
- majority class
- sampling methods
- cost sensitive learning
- base classifiers
- rare class
- random forests
- training set
- machine learning
- test data
- prediction accuracy
- decision trees
- neural network
- active learning
- feature selection
- training data
- training dataset
- high dimensionality
- support vector machine
- classification error
- cost sensitive