An extensive empirical study of inconsistent labels in multi-version-project defect data sets.
Shiran LiuZhaoqiang GuoYanhui LiChuanqi WangLin ChenZhongbin SunYuming ZhouPublished in: CoRR (2021)
Keyphrases
- data sets
- training data
- training set
- pairwise
- data streams
- european project
- real world
- case study
- real world data sets
- multi label
- nsf funded
- unseen data
- defect detection
- learning community
- benchmark data sets
- software engineering
- synthetic data
- data collection
- software development
- current status
- data points
- neural network
- database