Investigating practical, linear temporal difference learning.
Adam M. WhiteMartha WhitePublished in: CoRR (2016)
Keyphrases
- temporal difference learning
- temporal difference learning algorithms
- function approximation
- fixed point
- evaluation function
- game playing
- reinforcement learning
- temporal difference
- function approximators
- approximate value iteration
- reinforcement learning algorithms
- markov decision process
- monte carlo
- search space
- graphical models
- closed form
- machine learning
- multi agent