Variable selection from a feature representing protein sequences: a case of classification on bacterial type IV secreted effectors.
Jian ZhangLixin LvDonglei LuDenan KongMohammed Abdoh Ali Al-AlashaariXudong ZhaoPublished in: BMC Bioinform. (2020)
Keyphrases
- variable selection
- protein sequences
- cross validation
- computational biology
- protein classification
- model selection
- dimension reduction
- input variables
- feature vectors
- feature selection
- high dimensional
- protein structure
- amino acids
- preprocessing
- classification accuracy
- feature set
- machine learning
- image classification
- feature extraction
- text classification
- unsupervised learning
- support vector machine svm
- support vector machine
- training set
- decision trees
- secondary structure
- protein function
- supervised learning
- high dimensionality
- feature space
- pattern recognition
- support vector
- genetic algorithm