Load balancing in reducers for skewed data in MapReduce systems by using scalable simple random sampling.
Elaheh GavagsazAli RezaeeHamid Haj Seyyed JavadiPublished in: J. Supercomput. (2018)
Keyphrases
- load balancing
- skewed data
- random sampling
- distributed systems
- active learning
- data skew
- sampling algorithm
- grid computing
- data intensive
- peer to peer
- mobile agents
- sample size
- distributed computing
- class imbalance
- parallel processing
- data structure
- reinforcement learning
- web services
- sampling methods
- feature selection
- data sets