Class Imbalance Data-Generation for Software Defect Prediction.
Zheng LiXingyao ZhangJunxia GuoYing ShangPublished in: APSEC (2019)
Keyphrases
- data generation
- software defect prediction
- class imbalance
- active learning
- concept drift
- streaming data
- class distribution
- data streams
- cost sensitive learning
- cost sensitive
- learning algorithm
- training examples
- machine learning
- sampling methods
- labeled data
- supervised learning
- ensemble learning
- semi supervised
- feature selection
- random sampling
- imbalanced data
- training set
- feature ranking
- misclassification costs
- semi supervised learning
- high dimensionality
- co training
- learning environment
- minority class
- multi class
- unlabeled data