Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency.
Qi CaiZhuoran YangZhaoran WangPublished in: ICML (2022)
Keyphrases
- function approximation
- reinforcement learning
- temporal difference learning algorithms
- function approximators
- temporal difference learning
- mountain car
- temporal difference
- radial basis function
- state action space
- learning tasks
- tile coding
- model free
- td learning
- reinforcement learning algorithms
- state space
- dynamic programming
- neural network
- small number
- multi agent
- machine learning
- real valued
- optimal control
- action selection
- support vector machine
- policy evaluation
- reinforcement learning problems
- temporal difference methods
- learning algorithm