ProtoNet 6.0: organizing 10 million protein sequences in a compact hierarchical family tree.
Nadav RappoportSolange KarsentyAmos SternNathan LinialMichal LinialPublished in: Nucleic Acids Res. (2012)
Keyphrases
- protein sequences
- computational biology
- protein structure
- amino acids
- tree structure
- biological sequences
- protein families
- amino acid sequences
- sequence analysis
- secondary structure
- sequence databases
- multiple sequence alignment
- protein structure and function
- multiple alignment
- protein classification
- protein function
- r tree
- protein secondary structure
- protein structure prediction
- spanning tree
- statistically significant
- index structure
- machine learning