Login / Signup
Acquiring of walking behavior for four-legged robots using actor-critic method based on policy gradient.
Ryo Inoue
Kota Watanabe
Hajime Igarashi
Published in:
ISIC (2010)
Keyphrases
</>
policy gradient
cost function
dynamic programming
reinforcement learning
support vector machine
support vector machine svm