C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation.
Chengqian Gao
Ke Xu
Liu Liu
Deheng Ye
Peilin Zhao
Zhiqiang Xu
Published in:
CoRR (2022)
Keyphrases
</>
reinforcement learning
constraint relaxation
constraint satisfaction
multi agent
edge detection
function approximation
objective function
simulated annealing
learning algorithm
transfer learning
markov decision processes
model free