C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Learning Mixtures of Markov Chains and MDPs.
Chinmaya Kausik
Kevin Tan
Ambuj Tewari
Published in:
ICML (2023)
Keyphrases
</>
markov chain
state space
reinforcement learning
markov decision processes
transition probabilities
learning algorithm
dynamic programming
steady state
finite state
objective function
higher order
non stationary
markov process