Sign in

Balancing reducer workload for skewed data using sampling-based partitioning.

Yujie XuWenyu QuZhiyang LiZhaobin LiuChangqing JiYuanyuan LiHaifeng Li
Published in: Comput. Electr. Eng. (2014)
Keyphrases
  • skewed data
  • load balancing
  • streaming data
  • response time
  • class imbalance
  • databases
  • feature selection
  • decision trees
  • relational databases
  • distributed systems