Sign in

Handling data skew at reduce stage in Spark by ReducePartition.

Wenxia GuoChaojie HuangWenhong Tian
Published in: Concurr. Comput. Pract. Exp. (2020)
Keyphrases
  • data skew
  • data distribution
  • load balancing
  • parallel database systems
  • databases
  • semi structured