Login / Signup
Guided Meta-Policy Search.
Russell Mendonca
Abhishek Gupta
Rosen Kralev
Pieter Abbeel
Sergey Levine
Chelsea Finn
Published in:
NeurIPS (2019)
Keyphrases
</>
policy search
reinforcement learning
dynamic programming
continuous state
reinforcement learning algorithms
continuous action
reward function
policy gradient
neural network
real valued
partially observable markov decision processes
multi agent