Permissive Supervisor Synthesis for Markov Decision Processes Through Learning.
Bo WuXiaobin ZhangHai LinPublished in: IEEE Trans. Autom. Control. (2019)
Keyphrases
- markov decision processes
- reinforcement learning
- learning algorithm
- state space
- model based reinforcement learning
- optimal policy
- stochastic games
- partially observable
- transition matrices
- state abstraction
- reinforcement learning algorithms
- learning tasks
- finite state
- finite horizon
- action space
- multistage
- decision theoretic planning
- supervised learning
- multi agent