Provably Safe Reinforcement Learning with Step-wise Violation Constraints.
Nuoya XiongYihan DuLongbo HuangPublished in: NeurIPS (2023)
Keyphrases
- step wise
- reinforcement learning
- function approximation
- machine learning
- relational and xml data
- constraint violations
- learning process
- constrained optimization
- reinforcement learning algorithms
- multi agent
- supervised learning
- learning problems
- constraint networks
- mixed integer
- function approximators
- temporal difference learning
- multi agent reinforcement learning
- case study
- real time