Sign in

Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias.

Max Sobol MarkArchit SharmaFahim TajwarRafael RafailovSergey LevineChelsea Finn
Published in: CoRR (2023)
Keyphrases