Snorkel: Rapid Training Data Creation with Weak Supervision.
Alexander RatnerStephen H. BachHenry R. EhrenbergJason Alan FriesSen WuChristopher RéPublished in: CoRR (2017)
Keyphrases
- training data
- decision trees
- prior knowledge
- test data
- learning algorithm
- supervised learning
- data sets
- training set
- support vector machine
- creation process
- domain knowledge
- unlabeled data
- training examples
- test set
- classification models
- databases
- active learning
- database
- information systems
- training samples
- genetic algorithm
- generalization error
- training dataset
- training instances
- learned from training data