Login / Signup
Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.
Weizhong Li
Adam Godzik
Published in:
Bioinform. (2006)
Keyphrases
</>
nucleotide sequences
sequence data
molecular biology
protein sequences
dna sequences
clustering algorithm
k means
clustering method
computational biology
protein structure
genome scale
amino acids
information theoretic
sequence similarity
high dimensional data
systems biology
protein folding