The performance evaluation of k-means by two MapReduce frameworks, Hadoop vs. Twister.
Yunhee KangYoung B. ParkPublished in: ICOIN (2015)
Keyphrases
- k means
- cloud computing
- mapreduce framework
- map reduce
- data analytics
- distributed computing
- clustering algorithm
- data intensive
- data clustering
- open source
- big data
- self organizing maps
- clustering method
- distributed systems
- cloud computing platform
- spectral clustering
- large scale data sets
- cluster analysis
- distributed processing
- data management
- high performance data mining
- commodity hardware
- unsupervised clustering
- hierarchical clustering
- clustering approaches
- frequent itemset mining
- fuzzy k means
- initial cluster centers
- expectation maximization
- parallel computation
- text clustering
- similarity measure
- cluster centers
- case study
- real world
- data analysis