Balanced training/test set sampling for proper evaluation of classification models.
Donghoon KangSejong OhPublished in: Intell. Data Anal. (2020)
Keyphrases
- test set
- classification models
- training data
- training set
- evaluation methodology
- error rate
- test data
- imbalanced data
- decision trees
- software quality classification
- training and test data
- feature selection
- feature set
- classification accuracy
- class distribution
- training examples
- training samples
- training and test sets
- data sets
- evolutionary algorithm
- training process
- models built
- fitness function
- attribute selection
- evaluation methods
- feature subset
- unlabeled data
- machine learning