A Statistical Analysis of Polyak-Ruppert Averaged Q-Learning.
Xiang LiWenhao YangJiadong LiangZhihua ZhangMichael I. JordanPublished in: AISTATS (2023)
Keyphrases
- statistical analysis
- reinforcement learning
- function approximation
- multi agent
- learning algorithm
- stochastic approximation
- cooperative
- state space
- statistical methods
- multi agent reinforcement learning
- optimal policy
- reinforcement learning algorithms
- model free
- statistical analyses
- action selection
- learning rate
- dynamic programming
- policy iteration
- hierarchical reinforcement learning
- neural network
- clinical data
- database
- multiagent learning
- relational reinforcement learning
- credit assignment