Login / Signup

Stop Regressing: Training Value Functions via Classification for Scalable Deep RL.

Jesse FarebrotherJordi OrbayQuan VuongAdrien Ali TaïgaYevgen ChebotarTed XiaoAlex IrpanSergey LevinePablo Samuel CastroAleksandra FaustAviral KumarRishabh Agarwal
Published in: CoRR (2024)
Keyphrases