Login / Signup
Improved Strongly Polynomial Algorithms for Deterministic MDPs, 2VPI Feasibility, and Discounted All-Pairs Shortest Paths.
Adam Karczmarz
Published in:
CoRR (2021)
Keyphrases
</>
markov decision processes
reinforcement learning
learning algorithm
minimum cost flow
optimization problems
factored mdps
strongly polynomial
shortest path
optimal policy