Robust Average-Reward Markov Decision Processes.
Yue WangAlvaro VelasquezGeorge K. AtiaAshley Prater-BennetteShaofeng ZouPublished in: CoRR (2023)
Keyphrases
- reward function
- markov decision processes
- average reward
- optimal policy
- reinforcement learning algorithms
- reinforcement learning
- state space
- discounted reward
- stochastic games
- policy iteration
- semi markov decision processes
- finite state
- dynamic programming
- optimality criterion
- hierarchical reinforcement learning
- state and action spaces
- markov decision process
- average cost
- total reward
- planning under uncertainty
- factored mdps
- infinite horizon
- decision theoretic planning
- model free
- long run