Login / Signup
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Yuhuai Wu
Elman Mansimov
Roger B. Grosse
Shun Liao
Jimmy Ba
Published in:
NIPS (2017)
Keyphrases
</>
trust region
dynamic programming
reinforcement learning
cost function
approximation methods
objective function
learning algorithm
artificial neural networks
optimization method
hessian matrix