Predicting antifreeze proteins with weighted generalized dipeptide composition and multi-regression feature selection ensemble.
Shunfang WangLin DengXinnan XiaZicheng CaoYu FeiPublished in: BMC Bioinform. (2021)
Keyphrases
- feature selection
- regression problems
- model selection
- supervised feature selection
- ensemble learning
- support vector machines for classification
- text categorization
- mining concept drifting data streams
- regression model
- feature set
- wrapper feature selection
- feature ranking
- predicting protein
- support vector
- cross validation
- microarray data
- protein protein interactions
- random forests
- training set
- random forest
- training data
- ensemble feature selection
- linear regression
- protein sequences
- multi class
- feature space
- mutual information
- knn
- ensemble classification
- support vector machine
- ensemble classifier
- dimensionality reduction
- text classification
- multi task
- genetic programming
- protein structure
- learning algorithm
- ensemble methods
- neural network
- machine learning
- selected features
- decision trees
- subcellular localization
- classification accuracy
- ensemble pruning
- real valued functions
- classifier ensemble
- high dimensionality
- gene selection
- feature selection algorithms
- information gain
- logistic regression
- feature subset