Sign in

Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning.

Qiwei DiHeyang ZhaoJiafan HeQuanquan Gu
Published in: CoRR (2023)
Keyphrases