Load Balancing Approach for a MapReduce Job Running on a Heterogeneous Hadoop Cluster.
Kamalakant Laxman BawankuleRupesh Kumar DewangAnil Kumar SinghPublished in: ICDCIT (2021)
Keyphrases
- data access
- load balancing
- mapreduce framework
- cloud computing
- data management
- data intensive
- grid environment
- job scheduling
- distributed systems
- cloud computing environment
- parallel database systems
- dynamic load balancing
- distributed computing
- load distribution
- fault tolerance
- fault tolerant
- mobile agents
- large scale data sets
- map reduce
- computing resources
- grid computing
- load balance
- resource utilization
- peer to peer
- big data
- data analytics
- open source
- data replication
- replication scheme
- data skew
- peer to peer systems
- computing platform
- data points
- cloud computing platform
- low overhead
- loosely coupled
- skewed data
- hierarchical clustering
- scheduling problem