Can k-NN imputation improve the performance of C4.5 with small software project data sets? A comparative evaluation.
Qinbao SongMartin J. ShepperdXiangru ChenJun LiuPublished in: J. Syst. Softw. (2008)
Keyphrases
- comparative evaluation
- knn
- software projects
- k nearest neighbor
- data sets
- nearest neighbor
- k nearest neighbour
- mixed data
- software development
- text categorization
- source code
- similarity search
- software engineering
- high dimensional data
- effort estimation
- voting methods
- software project management
- neural network
- missing values
- feature selection
- software quality
- distance function
- web document classification
- nearest neighbour
- training set
- machine learning
- real world
- software development effort