Login / Signup

Sampling-Based Partitioning in MapReduce for Skewed Data.

Yujie XuPeng ZouWenyu QuZhiyang LiKeqiu LiXiaoli Cui
Published in: ChinaGrid (2012)
Keyphrases
  • skewed data
  • load balancing
  • class imbalance
  • cloud computing
  • streaming data
  • distributed computing
  • parallel processing
  • data streams
  • feature vectors
  • active learning
  • distributed systems
  • peer to peer
  • grid computing