Login / Signup
An Investigation of Model-Free Planning.
Arthur Guez
Mehdi Mirza
Karol Gregor
Rishabh Kabra
Sébastien Racanière
Theophane Weber
David Raposo
Adam Santoro
Laurent Orseau
Tom Eccles
Greg Wayne
David Silver
Timothy P. Lillicrap
Published in:
ICML (2019)
Keyphrases
</>
model free
reinforcement learning
function approximation
reinforcement learning algorithms
temporal difference
policy iteration
planning problems
average reward
machine learning
pattern recognition
neural network
monte carlo
policy evaluation