Reinforcement Learning With Imperfect Safety Constraints.
Jin Woo RoGerald LüttgenDiedrich WolterPublished in: SafeAI@AAAI (2022)
Keyphrases
- reinforcement learning
- function approximation
- markov decision processes
- reinforcement learning algorithms
- global constraints
- constraint satisfaction
- robotic control
- temporal difference learning
- constraint programming
- learning process
- learning algorithm
- state space
- real world
- hidden markov models
- constrained optimization
- search engine
- linear constraints
- imperfect information
- genetic algorithm
- machine learning