Login / Signup
Convexity and Feedback in Approximate Dynamic Programming for Delivery Time Slot Pricing.
Denis Lebedev
Kostas Margellos
Paul Goulart
Published in:
IEEE Trans. Control. Syst. Technol. (2022)
Keyphrases
</>
approximate dynamic programming
linear program
dynamic programming
stochastic dynamic programming
reinforcement learning
step size
factored mdps
policy iteration
linear programming
average cost
learning algorithm
least squares