Offline Evaluation of Online Reinforcement Learning Algorithms.

Travis Mandel Yun-En Liu Emma Brunskill Zoran Popovic

Published in: AAAI (2016)

Keyphrases

reinforcement learning algorithms
reinforcement learning
state space
markov decision processes
eligibility traces
model free
function approximation
temporal difference
reinforcement learning problems
partially observable environments
learning algorithm
stochastic games
least squares
monte carlo