An efficient method to determine sample size in oversampling based on classification complexity for imbalanced data.
Dohyun LeeKyoungok KimPublished in: Expert Syst. Appl. (2021)
Keyphrases
- sample size
- model selection
- computational complexity
- classification accuracy
- classification algorithm
- support vector machine
- cross validation
- imbalanced data
- objective function
- pattern classification
- machine learning methods
- training samples
- text classification
- linear regression
- decision rules
- support vector machine svm
- machine learning
- feature set
- worst case
- supervised learning
- upper bound
- data points
- pairwise
- decision trees