Temporal Difference Models: Model-Free Deep RL for Model-Based Control.

Vitchyr Pong Shixiang Gu Murtaza Dalal Sergey Levine

Published in: CoRR (2018)

Keyphrases

model free
temporal difference
reinforcement learning
function approximation
reinforcement learning algorithms
td learning
policy evaluation
policy iteration
temporal difference learning
temporal difference methods
rl algorithms
machine learning algorithms
control strategies
average reward
actor critic
evaluation function
markov decision processes
monte carlo
machine learning
reinforcement learning methods