Login / Signup

A Convex Programming Approach for Discrete-Time Markov Decision Processes under the Expected Total Reward Criterion.

François DufourAlexandre Genadot
Published in: SIAM J. Control. Optim. (2020)
Keyphrases