Login / Signup
A theoretical and empirical analysis of Expected Sarsa.
Harm van Seijen
Hado van Hasselt
Shimon Whiteson
Marco A. Wiering
Published in:
ADPRL (2009)
Keyphrases
</>
reinforcement learning
data sets
database
image sequences
training data
support vector
expert systems
search space
temporal difference