Login / Signup
Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation.
Chengqian Gao
Ke Xu
Liu Liu
Deheng Ye
Peilin Zhao
Zhiqiang Xu
Published in:
CoRR (2022)
Keyphrases
</>
reinforcement learning
constraint relaxation
constraint satisfaction
multi agent
edge detection
function approximation
objective function
simulated annealing
learning algorithm
transfer learning
markov decision processes
model free