CD-HIT Suite: a web server for clustering and comparing biological sequences.
Ying HuangBeifang NiuYing GaoLimin FuWeizhong LiPublished in: Bioinform. (2010)
Keyphrases
- web server
- biological sequences
- website
- clustering algorithm
- k means
- clustering method
- web pages
- protein sequences
- sequence data
- self organizing maps
- molecular biology
- web search engines
- biological data
- unsupervised learning
- end users
- web usage mining
- cluster analysis
- dna sequences
- log files
- binding sites
- data points
- computational biology
- feature selection
- database systems
- web logs
- data analysis
- high throughput
- databases
- web search
- database management systems