Learning When Training Data are Costly: The Effect of Class Distribution on Tree Induction
Foster J. ProvostGary M. WeissPublished in: CoRR (2011)
Keyphrases
- training data
- class distribution
- learning algorithm
- prior knowledge
- supervised learning
- training set
- test set
- decision trees
- training samples
- learning curve
- generalization error
- learning problems
- test data
- data sets
- unsupervised learning
- training examples
- class labels
- learning tasks
- active learning
- learning process
- learning scheme
- data mining