Sign in
Natural actor-critic with baseline adjustment for variance reduction.
Tetsuro Morimura
Eiji Uchibe
Kenji Doya
Published in:
Artif. Life Robotics (2008)
Keyphrases
</>
variance reduction
natural actor critic
robot arm
sample size
monte carlo
importance sampling
reinforcement learning problems
function approximation
confidence intervals
naive bayes classifier
training data
reinforcement learning
position and orientation
reinforcement learning methods