Performance of deep reinforcement learning algorithms in two-echelon inventory control systems.

Francesco Stranieri Fabio Stella Chaaben Kouki

Published in: Int. J. Prod. Res. (2024)

Keyphrases

reinforcement learning algorithms
control system
reinforcement learning
state space
model free
markov decision processes
reinforcement learning problems
eligibility traces
reinforcement learning methods
temporal difference
learning algorithm
function approximation
reward function
partially observable environments
stochastic games
policy search
reward shaping
policy gradient
tabula rasa
optimal policy
dynamic environments
hidden markov models
search space
multi agent
data mining