Rare events and imbalanced datasets: an overview.
Maher MaaloufTheodore B. TrafalisPublished in: Int. J. Data Min. Model. Manag. (2011)
Keyphrases
- rare events
- imbalanced datasets
- class imbalance
- class distribution
- minority class
- fraud detection
- cost sensitive learning
- active learning
- cost sensitive
- imbalanced data
- sampling methods
- training data
- test set
- misclassification costs
- training set
- classification error
- decision boundary
- support vector machine
- concept drift
- unlabeled data
- data mining
- data mining techniques
- test data
- outlier detection
- class labels
- ensemble methods
- training dataset
- high dimensionality
- feature selection
- semi supervised learning
- importance sampling
- training examples
- error rate