Sign in

Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning.

Fan-Ming LuoTian XuXingchen CaoYang Yu
Published in: CoRR (2023)
Keyphrases