Convergent Fitted Value Iteration with Linear Function Approximation.
Daniel J. LizottePublished in: NIPS (2011)
Keyphrases
- function approximation
- temporal difference learning algorithms
- reinforcement learning
- function approximators
- model free
- temporal difference learning
- radial basis function
- markov decision processes
- learning tasks
- temporal difference
- optimal policy
- policy iteration
- state space
- temporal difference methods
- td learning
- fixed point
- reinforcement learning algorithms
- markov decision process
- linear combination
- dynamic programming