Login / Signup

Introduction of Fixed Mode States into Online Reinforcement Learning with Penalties and Rewards and its Application to Biped Robot Waist Trajectory Generation.

Seiya KurodaKazuteru MiyazakiHiroaki Kobayashi
Published in: J. Adv. Comput. Intell. Intell. Informatics (2012)
Keyphrases