Active Clustering of Biological Sequences.
Konstantin VoevodskiMaria-Florina BalcanHeiko RöglinShang-Hua TengYu XiaPublished in: J. Mach. Learn. Res. (2012)
Keyphrases
- biological sequences
- k means
- clustering algorithm
- self organizing maps
- clustering method
- sequence data
- molecular biology
- protein sequences
- motif finding
- biological data
- computational biology
- data structure
- data points
- cluster analysis
- database
- unsupervised learning
- semi supervised
- high dimensional data
- statistically significant
- similarity function
- dna sequences
- dynamic programming
- database systems
- databases