Automatic Orthologous-Protein-Clustering from Multiple Complete-Genomes by the Best Reciprocal BLAST Hits.
Sunshin KimKwang Su JungKeun Ho RyuPublished in: BioDM (2006)
Keyphrases
- sequence alignment
- comparative genomics
- clustering algorithm
- human genome
- sequence analysis
- protein sequences
- clustering method
- regulatory elements
- amino acids
- k means
- document clustering
- sequence data
- high dimensional
- pairwise
- genome sequences
- cluster analysis
- feature selection
- sequence similarity
- similarity measure