Login / Signup
Source Traces for Temporal Difference Learning.
Silviu Pitis
Published in:
AAAI (2018)
Keyphrases
</>
temporal difference learning
function approximation
fixed point
game playing
evaluation function
reinforcement learning
approximate value iteration
temporal difference
markov decision process
reinforcement learning algorithms
monte carlo
active learning
markov decision processes