Provably Safe Reinforcement Learning with Step-wise Violation Constraints.

Nuoya Xiong Yihan Du Longbo Huang

Published in: NeurIPS (2023)

Keyphrases

step wise
reinforcement learning
function approximation
machine learning
relational and xml data
constraint violations
learning process
constrained optimization
reinforcement learning algorithms
multi agent
supervised learning
learning problems
constraint networks
mixed integer
function approximators
temporal difference learning
multi agent reinforcement learning
case study
real time