Random forest-based prediction of protein sumoylation sites from sequence features.
Shaolei TengHong LuoLiangjiang WangPublished in: BCB (2010)
Keyphrases
- random forest
- feature set
- selected features
- feature importance
- random forests
- decision trees
- ensemble classifier
- prediction accuracy
- protein families
- fold cross validation
- sequence analysis
- feature extraction
- image features
- contact map
- feature vectors
- classification models
- protein structure prediction
- classification accuracy
- feature space
- ensemble methods
- data sets
- cancer classification
- ensemble learning
- face recognition
- class labels
- higher order
- data points