Rapid detection, classification and accurate alignment of up to a million or more related protein sequences.
Andrew F. NeuwaldPublished in: Bioinform. (2009)
Keyphrases
- protein sequences
- protein classification
- multiple sequence alignment
- multiple alignment
- remote homology detection
- sequence alignment
- computational biology
- amino acids
- accurate classification
- secondary structure
- classification accuracy
- feature selection
- protein structure
- multiple sequence alignments
- training set
- decision trees
- support vector
- biological sequences
- protein structure prediction
- feature space
- sequence analysis
- machine learning
- support vector machine
- text classification
- protein function
- pairwise
- feature vectors