Login / Signup

Policy Optimization with Smooth Guidance Rewards Learned from Sparse-Reward Demonstrations.

Guojian WangFaguo WuXiao ZhangTianyuan Chen
Published in: CoRR (2024)
Keyphrases