On Generalized Bellman Equations and Temporal-Difference Learning.
Huizhen YuAshique Rupam MahmoodRichard S. SuttonPublished in: Canadian Conference on AI (2017)
Keyphrases
- temporal difference learning
- function approximation
- fixed point
- reinforcement learning
- game playing
- evaluation function
- approximate value iteration
- reinforcement learning algorithms
- temporal difference
- markov decision process
- monte carlo
- video games
- regression model
- collaborative learning
- support vector machine
- search space
- multi agent