Concurrent Learning of Policy and Unknown Safety Constraints in Reinforcement Learning.
Lunet YifruAli BaheriPublished in: CoRR (2024)
Keyphrases
- reinforcement learning
- learning algorithm
- learning process
- policy search
- optimal policy
- knowledge acquisition
- markov decision processes
- learning problems
- evolutionary learning
- learning systems
- autonomous learning
- reinforcement learning methods
- partially observable environments
- actor critic
- mutual exclusion
- learning agents
- action selection
- online learning
- state space
- learning environment
- bayesian networks