Login / Signup
Stochastic dynamic programming heuristic for the (R,s,S) policy parameters computation.
Andrea Visentin
Steven D. Prestwich
Roberto Rossi
S. Armagan Tarim
Published in:
Comput. Oper. Res. (2023)
Keyphrases
</>
convergence rate
approximate dynamic programming
step size
stochastic dynamic programming
dynamic programming
search algorithm
lower bound
reinforcement learning
computational complexity
partially observable
control policy