Login / Signup
Oracle-Efficient Regret Minimization in Factored MDPs with Unknown Structure.
Aviv Rosenberg
Yishay Mansour
Published in:
NeurIPS (2021)
Keyphrases
</>
factored mdps
regret minimization
lower bound
state space
markov decision processes