Dataset Pruning: Reducing Training Data by Examining Generalization Influence.
Shuo YangZeke XieHanyu PengMin XuMingming SunPing LiPublished in: CoRR (2022)
Keyphrases
- individual differences
- training data
- training dataset
- test data
- training set
- weakly labeled
- learning algorithm
- representative set
- avoid overfitting
- data sets
- decision trees
- prior knowledge
- pruning method
- test set
- genetic algorithm
- classification models
- benchmark datasets
- labeled data
- supervised learning
- classification accuracy
- domain knowledge
- training samples
- semi supervised learning
- training instances
- pruning methods
- hidden markov models
- feature extraction