First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach.
Andrew J. WagenmakerYifang ChenMax SimchowitzSimon S. DuKevin G. JamiesonPublished in: ICML (2022)
Keyphrases
- function approximation
- robust estimation
- reinforcement learning
- function approximators
- temporal difference learning algorithms
- least squares
- temporal difference learning
- temporal difference
- model free
- learning tasks
- radial basis function
- reinforcement learning algorithms
- reward function
- motion field
- state space
- learning algorithm
- lower bound
- kernel methods
- loss function
- weight vector
- learning process
- feature space
- policy evaluation
- actor critic
- multimedia