Login / Signup
Learning Mixtures of Markov Chains and MDPs.
Chinmaya Kausik
Kevin Tan
Ambuj Tewari
Published in:
ICML (2023)
Keyphrases
</>
markov chain
state space
reinforcement learning
markov decision processes
transition probabilities
learning algorithm
dynamic programming
steady state
finite state
objective function
higher order
non stationary
markov process