Handling Extreme Class Imbalance in Technical Logbook Datasets.
Farhad AkhbardehCecilia Ovesdotter AlmMarcos ZampieriTravis DesellPublished in: ACL/IJCNLP (1) (2021)
Keyphrases
- class imbalance
- imbalanced datasets
- sampling methods
- class distribution
- imbalanced data
- binary classification problems
- class noise
- active learning
- cost sensitive learning
- cost sensitive
- imbalanced class distribution
- high dimensionality
- concept drift
- class imbalanced
- small disjuncts
- minority class
- majority class
- decision trees
- software defect prediction
- benchmark datasets
- pattern recognition
- feature selection
- training set
- e learning