Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs.
Aria HasanzadeZonuzyDileep M. KalathilSrinivas ShakkottaiPublished in: CoRR (2020)
Keyphrases
- reinforcement learning
- learning problems
- sample complexity
- learning algorithm
- supervised learning
- learning process
- markov decision processes
- learning tasks
- sample complexity bounds
- partially observable
- active learning
- state space
- data sets
- unsupervised learning
- dynamic programming
- vc dimension
- machine learning
- continuous state
- decision lists
- linear threshold
- policy search
- pac learnability