Offline Actor-Critic Reinforcement Learning Scales to Large Models.
Jost Tobias SpringenbergAbbas AbdolmalekiJingwei ZhangOliver GrothMichael BloeschThomas LampePhilemon BrakelSarah BechtleSteven KapturowskiRoland HafnerNicolas HeessMartin A. RiedmillerPublished in: CoRR (2024)