Login / Signup
Source Traces for Temporal Difference Learning.
Silviu Pitis
Published in:
CoRR (2019)
Keyphrases
</>
temporal difference learning
function approximation
fixed point
evaluation function
game playing
approximate value iteration
reinforcement learning
temporal difference
reinforcement learning algorithms
markov decision process
neural network
state space
linear programming