Dataset Pruning: Reducing Training Data by Examining Generalization Influence.
Shuo YangZeke XieHanyu PengMin XuMingming SunPing LiPublished in: ICLR (2023)
Keyphrases
- training data
- training dataset
- test data
- learning algorithm
- search space
- weakly labeled
- data sets
- training process
- training set
- representative set
- supervised learning
- pruning method
- noisy data
- domain knowledge
- feature set
- training samples
- training examples
- benchmark datasets
- decision trees
- avoid overfitting
- real world
- training instances
- learned from training data
- labelled data
- database
- machine learning
- classification models
- classification accuracy
- data structure