HSDD: a hybrid sampling strategy for class imbalance in defect prediction data sets.
Muhammed Maruf ÖztürkAhmet ZenginPublished in: FGCT (2016)
Keyphrases
- sampling strategy
- class imbalance
- sampling methods
- data sets
- defect prediction
- active learning
- class distribution
- cost sensitive
- concept drift
- training set
- original data
- training data
- software projects
- random sampling
- software repositories
- feature selection
- high dimensionality
- high dimensional data
- remote sensing
- sampling algorithm
- learning environment