Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path.

Qiwei Di Jiafan He Dongruo Zhou Quanquan Gu

Published in: CoRR (2024)

Keyphrases

online learning
worst case
learning process
bayesian networks
reinforcement learning
learning tasks