Simplifying Access to Large-scale Structured Datasets by Meta-Profiling with Scalable Training Set Enrichment.
Sophie PaviaRituparna KhanAnna PyaytMichael N. GubanovPublished in: SIGMOD Conference (2022)
Keyphrases
- training set
- web scale
- training dataset
- training data
- data sets
- test data
- test set
- massive scale
- scientific data analysis
- small scale
- structured data
- benchmark datasets
- supervised learning
- nearest neighbor
- access control
- classification accuracy
- real world
- cross validation
- active learning
- meta level
- random access
- million images
- database
- training examples
- class distribution
- highly scalable
- feature space
- trained classifiers