Distributionally Robust Off-Dynamics Reinforcement Learning: Provable Efficiency with Linear Function Approximation.
Zhishuai LiuPan XuPublished in: CoRR (2024)
Keyphrases
- function approximation
- reinforcement learning
- temporal difference learning algorithms
- function approximators
- temporal difference
- temporal difference learning
- state action space
- radial basis function
- mountain car
- robust optimization
- tile coding
- learning tasks
- model free
- policy evaluation
- temporal difference methods
- state space
- policy gradient
- reinforcement learning algorithms
- td learning
- optimal policy
- dynamic programming