Predictive Modeling of ICU Healthcare-Associated Infections from Imbalanced Data. Using Ensembles and a Clustering-Based Undersampling Approach.
Fernando Sánchez HernándezJuan Carlos Ballesteros HerráezMohamed S. KraiemMercedes Sánchez BarbaMaría N. Moreno GarcíaPublished in: CoRR (2020)
Keyphrases
- imbalanced data
- predictive modeling
- class imbalance
- majority class
- class distribution
- statistical modeling
- minority class
- data mining methods
- ensemble methods
- sampling methods
- active learning
- cost sensitive
- data analysis
- machine learning
- data mining
- knowledge discovery
- feature selection
- text mining
- cost sensitive learning
- base classifiers
- high dimensionality
- decision trees
- linear regression
- business intelligence
- data mining techniques
- random forest
- support vector machine
- ensemble classifier
- information retrieval
- benchmark datasets
- training set
- generalization ability
- least squares
- big data
- original data