Similarity-Based Node Distance Exploring and Locality-Aware Shuffle Optimization for Hadoop MapReduce.
Jihe WangDanghui WangMeng ZhangMeikang QiuBing GuoPublished in: SmartCloud (2017)
Keyphrases
- cloud computing
- open source
- mapreduce framework
- optimization problems
- path length
- distributed computing
- map reduce
- optimization algorithm
- global optimization
- data intensive
- distance measure
- real world
- data analytics
- hamming distance
- optimization process
- optimization method
- tree structure
- distributed systems
- big data
- distance function
- nearest neighbor
- case study
- neural network
- shortest distance