COVID-19 Pandemic Cyclic Lockdown Optimization Using Reinforcement Learning.
Mauricio ArangoLyudmil PelovPublished in: CoRR (2020)
Keyphrases
- reward function
- reinforcement learning
- markov decision processes
- state space
- markov decision process
- function approximation
- transition model
- optimization problems
- discrete optimization
- optimization method
- optimization algorithm
- global optimization
- learning algorithm
- public health
- neural network
- action space
- model free
- evolution strategy
- optimization model
- optimization process
- data mining
- information systems
- database systems
- optimization strategies
- multi agent reinforcement learning
- multi agent