Login / Signup
Off-Policy Evaluation With Online Adaptation for Robot Exploration in Challenging Environments.
Yafei Hu
Junyi Geng
Chen Wang
John Keller
Sebastian A. Scherer
Published in:
IEEE Robotics Autom. Lett. (2023)
Keyphrases
</>
policy evaluation
least squares
mobile robot
monte carlo
temporal difference
model free
policy iteration
reinforcement learning
variance reduction
function approximation
sample size
markov decision processes
markov chain
probabilistic model
dynamic programming
active learning
artificial neural networks