Optimal policy learning for COVID-19 prevention using reinforcement learning.
Muhammad Irfan UddinSyed Atif Ali ShahMahmoud Ahmad Al-KhasawnehAla Abdulsalam AlaroodEesa AlsolamiPublished in: J. Inf. Sci. (2022)
Keyphrases
- reinforcement learning
- optimal policy
- markov decision processes
- learning algorithm
- average reward reinforcement learning
- decision problems
- state space
- function approximation
- dynamic programming
- markov decision process
- machine learning
- bayesian reinforcement learning
- model free
- temporal difference
- infinite horizon
- state dependent
- temporal difference learning
- actor critic
- long run
- reward function
- reinforcement learning algorithms
- partially observable
- partially observable markov decision processes
- markov decision problems
- multistage