Sign in

Adaptive Baseline Enhances EM-Based Policy Search: Validation in a View-Based Positioning Task of a Smartphone Balancer.

Jiexin WangEiji UchibeKenji Doya
Published in: Frontiers Neurorobotics (2017)
Keyphrases
  • policy search
  • reinforcement learning
  • continuous state
  • reinforcement learning algorithms
  • expectation maximization
  • multi agent
  • mobile robot