Choosing Optimal Maintenance Time for Stateless Data-Processing Clusters - A Case Study of Hadoop Cluster.
Zhenyun ZhuangMin ShenHaricharan RamachandraSuja ViswesanPublished in: JSSPP (2016)
Keyphrases
- data processing
- clustering algorithm
- data clustering
- hierarchical clustering
- cluster analysis
- overlapping clusters
- inter cluster
- unsupervised clustering
- model based clustering
- data points
- subspace clustering
- agglomerative hierarchical clustering
- cluster centers
- disjoint clusters
- data analysis
- constrained clustering
- big data
- gene clusters
- open source
- arbitrary shape
- dynamic programming
- fuzzy clustering
- data management
- initial set
- clustering approaches
- clustering framework
- hierarchical structure
- mapreduce framework
- possibilistic clustering
- hierarchical agglomerative clustering
- cluster validity
- clustering quality
- software maintenance
- document clustering
- document clusters
- document corpus
- cluster membership
- similarity matrix
- clustering procedure
- density based clustering
- cluster structure
- clustering result
- fuzzy c means
- worst case
- k means
- case study