Login / Signup
Trajectory-based Probabilistic Policy Gradient for Learning Locomotion Behaviors.
Sungjoon Choi
Joohyung Kim
Published in:
ICRA (2019)
Keyphrases
</>
reinforcement learning
policy gradient
learning algorithm
learning process
mobile robot
learning problems
bayesian networks
supervised learning
generative model