Corruption-Robust Offline Reinforcement Learning with General Function Approximation.
Chenlu YeRui YangQuanquan GuTong ZhangPublished in: NeurIPS (2023)
Keyphrases
- function approximation
- reinforcement learning
- temporal difference
- temporal difference learning algorithms
- temporal difference learning
- model free
- tile coding
- learning tasks
- mountain car
- function approximators
- state space
- radial basis function
- reinforcement learning algorithms
- temporal difference methods
- td learning
- markov decision process
- state action space