Sign in

CROP: Conservative Reward for Model-based Offline Policy Optimization.

Hao LiXiao-Hu ZhouXiao-Liang XieShi-Qi LiuZhen-Qiu FengXiao-Yin LiuMei-Jiang GuiTian-Yu XiangDe-Xing HuangBo-Xian YaoZeng-Guang Hou
Published in: CoRR (2023)
Keyphrases