On the power and limits of sequence similarity based clustering of proteins into families.
Christian WiwieRichard RöttgerPublished in: PSB (2017)
Keyphrases
- clustering algorithm
- k means
- protein families
- data clustering
- sequence similarity
- clustering method
- unsupervised learning
- protein structure
- sequence analysis
- information theoretic
- power system
- cluster analysis
- hierarchical clustering
- outlier detection
- document clustering
- computational methods
- protein protein interactions
- protein structure prediction
- amino acid sequences
- feature space
- data sets