Adaptive Tree Backup Algorithms for Temporal-Difference Reinforcement Learning.
Brett DaleyIsaac ChanPublished in: CoRR (2022)
Keyphrases
- temporal difference
- reinforcement learning
- model free
- td learning
- function approximation
- function approximators
- reinforcement learning algorithms
- policy iteration
- evaluation function
- learning algorithm
- temporal difference learning
- policy evaluation
- actor critic
- inverse reinforcement learning
- data mining
- reinforcement learning methods
- dynamic programming
- state space
- decision making