Performance of deep reinforcement learning algorithms in two-echelon inventory control systems.
Francesco StranieriFabio StellaChaaben KoukiPublished in: Int. J. Prod. Res. (2024)
Keyphrases
- reinforcement learning algorithms
- control system
- reinforcement learning
- state space
- model free
- markov decision processes
- reinforcement learning problems
- eligibility traces
- reinforcement learning methods
- temporal difference
- learning algorithm
- function approximation
- reward function
- partially observable environments
- stochastic games
- policy search
- reward shaping
- policy gradient
- tabula rasa
- optimal policy
- dynamic environments
- hidden markov models
- search space
- multi agent
- data mining