Optimal Policy Learning for Disease Prevention Using Reinforcement Learning.
Zahid Alam KhanZhengyong FengMuhammad Irfan UddinNoor MastSyed Atif Ali ShahMuhammad ImtiazMahmoud Ahmad Al-KhasawnehMarwan MahmoudPublished in: Sci. Program. (2020)
Keyphrases
- reinforcement learning
- optimal policy
- markov decision processes
- state space
- learning algorithm
- average reward reinforcement learning
- long run
- infinite horizon
- markov decision process
- finite horizon
- temporal difference learning
- decision problems
- function approximation
- temporal difference
- bayesian reinforcement learning
- multistage
- model free
- state dependent
- function approximators
- average reward
- reinforcement learning methods
- reinforcement learning algorithms
- cost function