Login / Signup
GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding.
Dmitry Lepikhin
HyoukJoong Lee
Yuanzhong Xu
Dehao Chen
Orhan Firat
Yanping Huang
Maxim Krikun
Noam Shazeer
Zhifeng Chen
Published in:
ICLR (2021)
Keyphrases
</>
probabilistic model
statistical models
statistical model
modeling framework
databases
computer vision
information systems
decision making
database
database systems
complex systems
data driven
evolutionary algorithm
training data
e learning
neural network
data sets