A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning.
Yinmin ZhangJie LiuChuming LiYazhe NiuYaodong YangYu LiuWanli OuyangPublished in: CoRR (2023)
Keyphrases
- reinforcement learning
- real time
- online learning
- least squares
- robotic control
- multi agent
- machine learning
- function approximation
- estimation algorithm
- maximum likelihood estimation
- model free
- estimation accuracy
- cross cultural
- learning algorithm
- genetic algorithm
- dynamic programming
- state space
- real world
- density estimation
- online environment
- data sets