Using ensemble methods to deal with imbalanced data in predicting protein-protein interactions.
Yongqing ZhangDanling ZhangGang MiDaichuan MaGongbing LiYanzhi GuoMenglong LiMin ZhuPublished in: Comput. Biol. Chem. (2012)
Keyphrases
- imbalanced data
- ensemble methods
- prediction accuracy
- random forests
- decision trees
- benchmark datasets
- ensemble learning
- machine learning methods
- random forest
- base classifiers
- linear regression
- generalization ability
- ensemble classifier
- feature selection
- base learners
- class distribution
- sampling methods
- machine learning
- high dimensionality
- support vector machine
- class imbalance