Posterior Sampling for Competitive RL: Function Approximation and Partial Observation.
Shuang QiuZiyu DaiHan ZhongZhaoran WangZhuoran YangTong ZhangPublished in: NeurIPS (2023)
Keyphrases
- function approximation
- reinforcement learning
- tile coding
- model free
- temporal difference
- temporal difference learning algorithms
- temporal difference learning
- learning tasks
- reinforcement learning algorithms
- radial basis function
- function approximators
- probability distribution
- state space
- temporal difference methods
- td learning
- machine learning
- real valued
- feature extraction
- decision trees
- reinforcement learning problems
- genetic algorithm