Login / Signup
Guided Meta-Policy Search.
Russell Mendonca
Abhishek Gupta
Rosen Kralev
Pieter Abbeel
Sergey Levine
Chelsea Finn
Published in:
CoRR (2019)
Keyphrases
</>
policy search
reinforcement learning
reinforcement learning algorithms
continuous state
continuous action
dynamic programming
partially observable markov decision processes
policy gradient
bayesian networks
monte carlo
reward function