SOUL: Scala Oversampling and Undersampling Library for imbalance classification.
Néstor RodríguezDavid LópezAlberto FernándezSalvador GarcíaFrancisco HerreraPublished in: SoftwareX (2021)
Keyphrases
- class imbalance
- majority class
- active learning
- class distribution
- cost sensitive
- cost sensitive learning
- imbalanced datasets
- pattern recognition
- minority class
- classification systems
- classification accuracy
- support vector machine svm
- benchmark datasets
- pattern classification
- classification method
- decision rules
- feature selection
- feature space
- machine learning algorithms
- machine learning
- decision trees
- feature extraction
- classification process
- classification rate
- training set
- feature vectors
- high dimensionality
- classification models
- data streams
- classification scheme
- similarity measure
- misclassification costs
- sampling methods
- knn
- support vector machine
- data sets
- text classification