Reachability Constrained Reinforcement Learning.
Dongjie YuHaitong MaShengbo Eben LiJianyu ChenPublished in: CoRR (2022)
Keyphrases
- reinforcement learning
- state space
- function approximation
- machine learning
- optimal policy
- direct policy search
- model free
- markov decision processes
- reinforcement learning methods
- real time
- reinforcement learning algorithms
- query language
- policy search
- markov chain
- multi agent
- transitive closure
- data sets
- lagrange multipliers
- learning agent
- robot control
- partially observable
- learning algorithm
- bayesian networks
- dynamic environments
- supervised learning
- dynamic programming