Characterizing and benchmarking stand-alone Hadoop MapReduce on modern HPC clusters.
Dipti ShankarXiaoyi LuMd. Wasi-ur-RahmanNusrat S. IslamDhabaleswar K. PandaPublished in: J. Supercomput. (2016)
Keyphrases
- cloud computing
- mapreduce framework
- distributed computing
- map reduce
- clustering algorithm
- fault tolerance
- computing infrastructure
- open source
- high performance computing
- data intensive
- data analytics
- cluster analysis
- hierarchical clustering
- computing resources
- big data
- high performance data mining
- distributed systems
- fuzzy clustering
- fuzzy c means
- data sets
- parallel processing
- intra cluster
- k means
- fault tolerant
- user friendly