Login / Signup

REBEL: Reinforcement Learning via Regressing Relative Rewards.

Zhaolin GaoJonathan D. ChangWenhao ZhanOwen OertellGokul SwamyKianté BrantleyThorsten JoachimsJ. Andrew BagnellJason D. LeeWen Sun
Published in: CoRR (2024)
Keyphrases