Comparative Study of Apache Spark MLlib Clustering Algorithms.
Sasan HarifiEbrahim ByagowiMadjid KhalilianPublished in: DMBD (2017)
Keyphrases
- comparative study
- clustering algorithm
- open source
- open source software
- web server
- data clustering
- fuzzy c means
- cluster analysis
- clustering method
- k means
- document clustering
- fuzzy clustering
- density based clustering
- open source projects
- mailing lists
- unsupervised clustering
- overlapping clusters
- clustering quality
- clustering framework
- incremental clustering
- constrained clustering
- initial cluster centers
- hierarchical clustering
- simultaneous clustering
- clustering validity
- graph clustering
- search engine
- concept drift