Large scale clustering of protein sequences with FORCE -A layout based heuristic for weighted cluster editing.
Tobias WittkopJan BaumbachFrancisco Pereira LoboSven RahmannPublished in: BMC Bioinform. (2007)
Keyphrases
- protein sequences
- clustering algorithm
- data clustering
- computational biology
- k means
- cluster analysis
- amino acids
- data points
- multiple sequence alignments
- cluster centers
- protein structure
- clustering method
- protein structure and function
- protein classification
- secondary structure
- multiple sequence alignment
- intra cluster
- biological sequences
- sequence databases
- fuzzy clustering
- multiple alignment
- information theoretic
- protein function
- protein secondary structure
- clustering analysis
- protein structure prediction
- amino acid sequences
- self organizing maps
- protein structural
- document clustering