Online Learning in Adversarial MDPs: Is the Communicating Case Harder than Ergodic?

Gautam Chandrasekaran Ambuj Tewari

Published in: CoRR (2021)

Keyphrases

online learning
markov decision processes
online course
reinforcement learning
computer mediated
state space
markov chain
data sets
machine learning
e learning
multi agent
np hard
sufficient conditions
higher education
distance learning
blended learning