Login / Signup
Sample Efficient Actor-Critic with Experience Replay.
Ziyu Wang
Victor Bapst
Nicolas Heess
Volodymyr Mnih
Rémi Munos
Koray Kavukcuoglu
Nando de Freitas
Published in:
ICLR (Poster) (2017)
Keyphrases
</>
machine learning
reinforcement learning
sample size
actor critic
learning tasks
function approximation
temporal difference