Login / Signup
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping.
Richard S. Sutton
Csaba Szepesvári
Alborz Geramifard
Michael H. Bowling
Published in:
UAI (2008)
Keyphrases
</>
function approximation
temporal difference learning algorithms
function approximators
reinforcement learning
reinforcement learning problems
temporal difference learning
learning tasks
temporal difference
radial basis function
model free
planning problems
action selection
basis functions
td learning