Login / Signup
Automatic Cross-Replica Sharding of Weight Update in Data-Parallel Training.
Yuanzhong Xu
HyoukJoong Lee
Dehao Chen
Hongjun Choi
Blake A. Hechtman
Shibo Wang
Published in:
CoRR (2020)
Keyphrases
</>
data processing
training data
multiscale
end users