Training data selection based on dataset distillation for rapid deployment in machine-learning workflows.
Yuna JeongMyunggwon HwangWon-Kyung SungPublished in: Multim. Tools Appl. (2023)
Keyphrases
- machine learning
- training data
- training dataset
- learning algorithm
- decision trees
- supervised learning
- machine learning algorithms
- support vector machine
- training set
- data processing
- machine learning methods
- test data
- benchmark datasets
- training examples
- artificial intelligence
- data sets
- business processes
- unlabeled data
- text classification
- natural language processing
- classification accuracy
- active learning
- test set
- learned from training data
- sample selection
- representative set
- training process
- classification models
- learning tasks
- computer vision
- business process
- information extraction
- feature selection
- pattern recognition
- training samples
- model selection
- input data
- feature set
- web services
- data analysis
- neural network
- kernel methods
- prior knowledge
- generalization error
- semi supervised learning
- database