Convergence Results for Some Temporal Difference Methods Based on Least Squares.
Huizhen YuDimitri P. BertsekasPublished in: IEEE Trans. Autom. Control. (2009)
Keyphrases
- least squares
- temporal difference methods
- policy evaluation
- function approximation
- temporal difference
- optical flow
- evolutionary methods
- monte carlo
- convergence rate
- policy iteration
- convergence speed
- policy search
- function approximators
- reinforcement learning
- genetic algorithm
- dynamical systems
- evolutionary algorithm
- td learning