Safely selecting subsets of training data.
Dawei YinChang AnHenry S. BairdPublished in: Document Analysis Systems (2010)
Keyphrases
- training data
- decision trees
- test data
- training set
- supervised learning
- data sets
- learning algorithm
- domain knowledge
- case study
- training samples
- classification accuracy
- test set
- training process
- training instances
- search engine
- real time
- prior knowledge
- training dataset
- website
- labeled data
- database
- class labels
- search algorithm
- e learning
- computer vision
- sufficient training data