Login / Signup
Predictor-Corrector(PC) Temporal Difference(TD) Learning (PCTD).
Caleb Bowyer
Published in:
CoRR (2021)
Keyphrases
</>
td learning
temporal difference
evaluation function
function approximation
reinforcement learning
monte carlo
model free
reinforcement learning algorithms
policy evaluation
step size
policy iteration
supervised learning
td methods
neural network
decision making