Login / Signup
MPC-Net: A First Principles Guided Policy Search.
Jan Carius
Farbod Farshidian
Marco Hutter
Published in:
CoRR (2019)
Keyphrases
</>
policy search
reinforcement learning
continuous action
reinforcement learning algorithms
continuous state
dynamic programming
partially observable markov decision processes
optimal control
reward function
policy gradient