Frugal Gaussian clustering of huge imbalanced datasets through a bin-marginal approach.
Filippo AntonazzoChristophe BiernackiChristine KeribinPublished in: Stat. Comput. (2023)
Keyphrases
- imbalanced datasets
- imbalanced class distribution
- clustering algorithm
- learning from imbalanced data
- class distribution
- outlier detection
- cost sensitive learning
- sampling methods
- imbalanced data
- decision trees
- class imbalance
- maximum likelihood
- probability distribution
- training dataset
- data points
- ensemble methods
- cost sensitive
- high dimensionality
- high dimensional data
- unsupervised learning
- data sets