Rapid detection, classification and accurate alignment of up to a million or more related protein sequences.
Andrew F. NeuwaldPublished in: Bioinform. (2009)
Keyphrases
- protein sequences
- protein classification
- multiple sequence alignment
- remote homology detection
- multiple alignment
- classification accuracy
- computational biology
- sequence alignment
- accurate classification
- protein structure
- support vector machine
- decision trees
- support vector
- support vector machine svm
- amino acids
- secondary structure
- molecular biology
- feature vectors
- high precision
- biological sequences
- text classification
- string kernels
- protein structure prediction
- feature space
- sequence analysis
- amino acid sequences
- multiple sequence alignments
- automated analysis
- feature selection
- rna sequences
- genetic algorithm
- computational methods
- feature set