Fast embedding methods for clustering tens of thousands of sequences.
Gordon BlackshieldsMark A. LarkinIain M. WallaceAndreas WilmDesmond G. HigginsPublished in: Comput. Biol. Chem. (2008)
Keyphrases
- tens of thousands
- clustering algorithm
- k means
- clustering method
- information theoretic
- categorical data
- hierarchical clustering
- high dimensional data
- data clustering
- hidden markov models
- data points
- cluster analysis
- outlier detection
- databases
- temporal patterns
- spectral clustering
- machine learning
- database
- anomaly detection
- real world
- data mining
- social networks