Training data selection for imbalanced cross-project defect prediction.
Shang ZhengJinjing GaiHualong YuHaitao ZouShang GaoPublished in: Comput. Electr. Eng. (2021)
Keyphrases
- training data
- defect prediction
- software projects
- data sets
- decision trees
- class distribution
- training set
- classification accuracy
- project management
- test data
- learning algorithm
- training samples
- test set
- software repositories
- training examples
- software development
- prior knowledge
- training process
- class imbalance
- case study
- classification models
- knowledge discovery
- software maintenance
- imbalanced data
- imbalanced datasets
- real world