Certainty-based active learning for sampling imbalanced datasets.
JuiHsi FuSingLing LeePublished in: Neurocomputing (2013)
Keyphrases
- imbalanced datasets
- imbalanced class distribution
- active learning
- sampling methods
- random sampling
- class imbalance
- rare class
- cost sensitive learning
- imbalanced data
- learning from imbalanced data
- minority class
- class distribution
- majority class
- sampling algorithm
- decision trees
- cost sensitive
- highly skewed
- ensemble methods
- semi supervised
- learning algorithm
- machine learning
- training dataset
- unlabeled data
- sample size
- binary classification
- training examples
- training set
- supervised learning
- data sets
- pairwise
- support vector machine
- fraud detection
- feature selection algorithms
- generalization error
- small number
- transfer learning
- semi supervised learning