Login / Signup
Offline Evaluation of Online Reinforcement Learning Algorithms.
Travis Mandel
Yun-En Liu
Emma Brunskill
Zoran Popovic
Published in:
AAAI (2016)
Keyphrases
</>
reinforcement learning algorithms
reinforcement learning
state space
markov decision processes
eligibility traces
model free
function approximation
temporal difference
reinforcement learning problems
partially observable environments
learning algorithm
stochastic games
least squares
monte carlo