SubFeat: Feature subspacing ensemble classifier for function prediction of DNA, RNA and protein sequences.
H. M. Fazlul HaqueRafsanjani MuhammodFariha ArifinSheikh AdilinaSwakkhar ShatabdaPublished in: Comput. Biol. Chem. (2021)
Keyphrases
- protein sequences
- nucleotide sequences
- ensemble classifier
- rna sequences
- sequence analysis
- genome sequences
- secondary structure
- protein secondary structure
- biological sequences
- protein classification
- structural motifs
- computational biology
- protein structure prediction
- multiple alignment
- protein structure
- multiple sequence alignments
- preprocessing step
- sequence databases
- ensemble learning
- prediction accuracy
- molecular biology
- amino acids
- concept drift
- random forest
- ensemble methods
- support vector machine
- multiple sequence alignment
- sequence data
- image features
- feature selection
- dna sequences
- feature vectors
- remote homology detection
- experimentally determined
- amino acid sequences
- classification models
- sequence alignment
- base classifiers
- feature set
- decision trees
- protein secondary structure prediction
- genomic sequences
- genome wide
- generalization ability