Downsampling for Binary Classification with a Highly Imbalanced Dataset Using Active Learning.
Wonjae LeeKangwon SeoPublished in: Big Data Res. (2022)
Keyphrases
- binary classification
- imbalanced datasets
- active learning
- class imbalance
- cost sensitive learning
- cost sensitive
- active learning strategies
- rare class
- generalization error
- class distribution
- ensemble methods
- multi class classification
- multi class
- support vector
- misclassification costs
- binary classifiers
- sampling methods
- training examples
- prediction accuracy
- semi supervised
- training set
- random sampling
- unlabeled data
- minority class
- support vector machine
- learning process
- supervised learning
- learning problems
- multi label
- learning algorithm
- semi supervised learning
- feature extraction
- decision trees
- machine learning
- training dataset
- machine learning methods
- class labels
- training data