Login / Signup
Algorithm for Constrained Markov Decision Process with Linear Convergence.
Egor Gladin
Maksim Lavrik-Karmazin
Karina Zainullina
Varvara Rudenko
Alexander V. Gasnikov
Martin Takác
Published in:
AISTATS (2023)
Keyphrases
</>
iterative algorithms
learning algorithm
dynamic programming
computational complexity
k means
markov decision process
reinforcement learning
objective function
optimal solution
expectation maximization
probability distribution
supply chain
convergence rate
initial state