Accelerated and Instance-Optimal Policy Evaluation with Linear Function Approximation.
Tianjiao LiGuanghui LanAshwin PananjadyPublished in: SIAM J. Math. Data Sci. (2023)
Keyphrases
- function approximation
- policy evaluation
- temporal difference
- reinforcement learning
- td learning
- function approximators
- model free
- least squares
- learning tasks
- radial basis function
- monte carlo
- optimal control
- dynamic programming
- optimal solution
- markov decision processes
- reinforcement learning algorithms
- policy iteration
- evaluation function
- linear programming
- semi parametric
- search space
- data mining