A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning.

Yinmin Zhang Jie Liu Chuming Li Yazhe Niu Yaodong Yang Yu Liu Wanli Ouyang

Published in: CoRR (2023)

Keyphrases

reinforcement learning
real time
online learning
least squares
robotic control
multi agent
machine learning
function approximation
estimation algorithm
maximum likelihood estimation
model free
estimation accuracy
cross cultural
learning algorithm
genetic algorithm
dynamic programming
state space
real world
density estimation
online environment
data sets