Density Peak clustering of protein sequences associated to a Pfam clan reveals clear similarities and interesting differences with respect to manual family annotation.
Elena Tea RussoAlessandro LaioMarco PuntaPublished in: BMC Bioinform. (2021)
Keyphrases
- protein sequences
- protein families
- essential genes
- computational biology
- multiple sequence alignment
- amino acids
- manual annotation
- protein classification
- secondary structure
- amino acid sequences
- protein structure
- protein structure and function
- biological sequences
- protein structure prediction
- protein function
- statistically significant
- multiple alignment
- sequence analysis
- self organizing maps
- protein secondary structure
- amino acid composition
- sequence alignment
- graph theoretic
- information theoretic
- sequence similarity
- k means
- similarity measure