Modeling of class imbalance using an empirical approach with spambase dataset and random forest classification.
Kiranmayi KotipalliShan SuthaharanPublished in: RIIT (2014)
Keyphrases
- class imbalance
- random forest
- fold cross validation
- feature set
- imbalanced data
- decision trees
- class distribution
- random forests
- cost sensitive
- active learning
- imbalanced datasets
- cost sensitive learning
- feature selection
- high dimensionality
- concept drift
- ensemble methods
- benchmark datasets
- ensemble classifier
- feature extraction
- sampling methods
- support vector machine
- classification accuracy
- base classifiers
- feature space
- feature vectors
- class labels
- multi label
- classification models
- data mining
- multi class
- ensemble learning
- nearest neighbor
- image classification
- test set