Login / Signup

Learning a dynamic policy by using policy gradient: application to biped walking.

Takamitsu MatsubaraJun MorimotoJun NakanishiMasa-aki SatoKenji Doya
Published in: Systems and Computers in Japan (2007)
Keyphrases