Login / Signup
Adaptive Baseline Enhances EM-Based Policy Search: Validation in a View-Based Positioning Task of a Smartphone Balancer.
Jiexin Wang
Eiji Uchibe
Kenji Doya
Published in:
Frontiers Neurorobotics (2017)
Keyphrases
</>
policy search
reinforcement learning
continuous state
reinforcement learning algorithms
expectation maximization
multi agent
mobile robot