Login / Signup
Online Learning in Adversarial MDPs: Is the Communicating Case Harder than Ergodic?
Gautam Chandrasekaran
Ambuj Tewari
Published in:
CoRR (2021)
Keyphrases
</>
online learning
markov decision processes
online course
reinforcement learning
computer mediated
state space
markov chain
data sets
machine learning
e learning
multi agent
np hard
sufficient conditions
higher education
distance learning
blended learning