SVM Learning from Imbalanced Data by GA Sampling for Protein Domain Prediction.
Shu-Xue ZouYanxin HuangYan WangJianxin WangChunguang ZhouPublished in: ICYCS (2008)
Keyphrases
- imbalanced data
- learning from imbalanced data
- imbalanced datasets
- genetic algorithm
- prediction accuracy
- genetic algorithm ga
- support vector
- sampling methods
- support vector machine svm
- support vector machine
- ensemble methods
- knn
- feature selection
- class distribution
- random forest
- machine learning
- protein sequences
- ensemble classifier
- svm classifier
- training data
- simulated annealing
- protein structure
- neural network
- random sampling
- fitness function
- training dataset
- decision boundary
- multi class