Introduction of Fixed Mode States into Online Reinforcement Learning with Penalties and Rewards and its Application to Biped Robot Waist Trajectory Generation.

Published in: J. Adv. Comput. Intell. Intell. Informatics (2012)

Keyphrases