Markov Decision Process Framework for Control-Based Reinforcement Learning.
Yingdong LuMark S. SquillanteChai Wah WuPublished in: SIGMETRICS Perform. Evaluation Rev. (2023)
Keyphrases
- markov decision process
- reinforcement learning
- state space
- optimal policy
- markov decision processes
- control system
- control problems
- temporal difference learning
- optimal control
- markov games
- np hard
- search algorithm
- semi supervised
- machine learning
- optimal solution
- infinite horizon
- model free
- objective function
- partial observability