Expand your Training Limits! Generating Training Data for ML-based Data Management.
Francesco VenturaZoi KaoudiJorge-Arnulfo Quiané-RuizVolker MarklPublished in: SIGMOD Conference (2021)
Keyphrases
- data management
- training data
- training process
- training set
- training examples
- test set
- training dataset
- supervised learning
- maximum likelihood
- labelled data
- training samples
- learning algorithm
- database management systems
- labeled data for training
- training patterns
- labeled training data
- data sets
- training and testing data
- training and test data
- decision trees
- database
- avoid overfitting
- test data
- training corpus
- training algorithm
- databases
- sample selection
- database systems
- query processing
- class labels
- classification accuracy
- online learning
- neural network
- data integration
- active learning
- domain knowledge
- support vector machine