ESPRIT-Forest: Parallel clustering of massive amplicon sequence data in subquadratic time.
Yunpeng CaiWei ZhengJin YaoYujie YangVolker MaiQi MaoYijun SunPublished in: PLoS Comput. Biol. (2017)
Keyphrases
- sequence data
- sequence analysis
- clustering algorithm
- sequence classification
- profile hidden markov models
- k means
- clustering method
- biological sequences
- sequential data
- nucleotide sequences
- cluster analysis
- gene clusters
- data mining tasks
- document clustering
- high dimensional data
- gene expression analysis
- genome sequences
- alternative splicing
- biological information