Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation.

Published in: CoRR (2022)

Keyphrases