Adaptive Tree Backup Algorithms for Temporal-Difference Reinforcement Learning.

Brett Daley Isaac Chan

Published in: CoRR (2022)

Keyphrases

temporal difference
reinforcement learning
model free
td learning
function approximation
function approximators
reinforcement learning algorithms
policy iteration
evaluation function
learning algorithm
temporal difference learning
policy evaluation
actor critic
inverse reinforcement learning
data mining
reinforcement learning methods
dynamic programming
state space
decision making