Login / Signup
Why ADAM Beats SGD for Attention Models.
Jingzhao Zhang
Sai Praneeth Karimireddy
Andreas Veit
Seungyeon Kim
Sashank J. Reddi
Sanjiv Kumar
Suvrit Sra
Published in:
CoRR (2019)
Keyphrases
</>
complex systems
statistical models
learning models
process model
neural network
search engine
prior knowledge
hidden markov models
statistical model
statistical methods
accurate models