Login / Signup
Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path.
Qiwei Di
Jiafan He
Dongruo Zhou
Quanquan Gu
Published in:
CoRR (2024)
Keyphrases
</>
online learning
worst case
learning process
bayesian networks
reinforcement learning
learning tasks