Login / Signup
Regret guarantees for model-based reinforcement learning with long-term average constraints.
Mridul Agarwal
Qinbo Bai
Vaneet Aggarwal
Published in:
UAI (2022)
Keyphrases
</>
long term
model based reinforcement learning
markov decision processes
constraint satisfaction
reinforcement learning