EdtClust: A fast homologous protein sequences clustering method based on edit distance.
Yixin XiangJiachang GuJianyu ZhouPublished in: BIBM (2023)
Keyphrases
- clustering method
- edit distance
- protein sequences
- similarity measure
- sequence alignment
- computational biology
- graph matching
- string matching
- protein structure
- amino acids
- biological sequences
- cluster analysis
- secondary structure
- clustering algorithm
- spectral clustering
- distance function
- dissimilarity measure
- document clustering
- distance measure
- k means
- dynamic programming
- euclidean distance
- graph clustering
- string kernels
- sequence databases
- pattern matching
- data mining techniques
- molecular biology
- object recognition
- machine learning