Gene identification and protein classification in microbial metagenomic sequence data via incremental clustering.
Shibu YoosephWeizhong LiGranger G. SuttonPublished in: BMC Bioinform. (2008)
Keyphrases
- sequence data
- incremental clustering
- protein classification
- protein sequences
- essential genes
- hierarchical clustering
- sequence analysis
- sequence alignment
- binding sites
- concept drift
- clustering algorithm
- biological sequences
- kernel methods
- string kernels
- gene expression
- computational biology
- gene expression data
- microarray
- machine learning
- gene ontology
- high throughput