A Summary of Online Markov Decision Processes with Non-oblivious Strategic Adversary.
Le Cong DinhDavid Henry MguniLong Tran-ThanhJun WangYaodong YangPublished in: AAMAS (2024)
Keyphrases
- markov decision processes
- state space
- optimal policy
- policy iteration
- transition matrices
- reinforcement learning
- finite state
- dynamic programming
- factored mdps
- reachability analysis
- infinite horizon
- partially observable
- decision theoretic planning
- model based reinforcement learning
- average cost
- finite horizon
- average reward
- planning under uncertainty
- markov decision process
- decision making
- action space
- risk sensitive
- state abstraction
- total reward
- reinforcement learning algorithms
- reward function
- continuous state spaces