Single-pass and linear-time k-means clustering based on MapReduce.
Saeed ShahrivariSaeed JaliliPublished in: Inf. Syst. (2016)
Keyphrases
- single pass
- k means
- hierarchical agglomerative
- clustering algorithm
- clustering method
- high performance data mining
- data clustering
- spectral clustering
- hierarchical clustering
- cloud computing
- self organizing maps
- rough k means
- highly parallel
- cluster centers
- parallel processing
- hidden markov random fields
- worst case
- regression forests
- initial cluster centers
- unsupervised clustering
- document clustering
- stream mining
- distributed processing
- cluster analysis
- expectation maximization
- fuzzy k means
- fuzzy clustering
- validity indices
- cluster validity index
- squared euclidean distance
- clustering quality