Snorkel: rapid training data creation with weak supervision.
Alexander RatnerStephen H. BachHenry R. EhrenbergJason A. FriesSen WuChristopher RéPublished in: VLDB J. (2020)
Keyphrases
- training data
- test data
- learning algorithm
- training set
- test set
- decision trees
- training examples
- training process
- classification accuracy
- supervised learning
- labeled data
- labeled training data
- training samples
- data sets
- class labels
- training instances
- generalization error
- classification models
- neural network
- naive bayes
- domain knowledge
- multiscale
- data mining
- support vector machine
- prior knowledge
- noisy data
- unlabeled data
- real time
- databases
- training dataset
- classification trees
- small number