Improving Cross-Project Defect Prediction Methods with Data Simplification.
Sousuke AmasakiKazuya KawataTomoyuki YokogawaPublished in: EUROMICRO-SEAA (2015)
Keyphrases
- data sets
- statistical methods
- high quality
- noisy data
- data structure
- database
- data processing
- data mining methods
- high dimensional data
- input data
- image data
- knowledge discovery
- data analysis
- data mining techniques
- spectral clustering
- raw data
- data collection
- human experts
- cooperative
- missing values
- data distribution
- defect prediction
- information systems
- case study
- missing data
- synthetic data
- computer systems
- preprocessing
- significant improvement
- original data
- multiresolution
- xml documents
- statistical tests
- data sources