Login / Signup
GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding.
Dmitry Lepikhin
HyoukJoong Lee
Yuanzhong Xu
Dehao Chen
Orhan Firat
Yanping Huang
Maxim Krikun
Noam Shazeer
Zhifeng Chen
Published in:
CoRR (2020)
Keyphrases
</>
database
statistical models
evolutionary algorithm
complex systems
data mining
bayesian networks
fully automatic
random fields
efficient computation
modeling framework