Discovering Minority Sub-clusters and Local Difficulty Factors from Imbalanced Data.
Mateusz LangoDariusz BrzezinskiSebastian FirlikJerzy StefanowskiPublished in: DS (2017)
Keyphrases
- imbalanced data
- class distribution
- majority class
- minority class
- highly imbalanced
- clustering algorithm
- class imbalance
- linear regression
- imbalanced class distribution
- feature selection
- imbalanced datasets
- class imbalanced
- support vector machine
- classification models
- random forest
- data points
- decision trees
- decision boundary
- sampling methods
- ensemble classifier
- training data
- cost sensitive
- data distribution
- feature set
- misclassification costs
- ensemble methods
- cost sensitive learning
- least squares
- data sets
- feature space
- nearest neighbour
- training samples
- svm classifier
- high dimensionality