• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path.

Qiwei DiJiafan HeDongruo ZhouQuanquan Gu
Published in: CoRR (2024)
Keyphrases
  • online learning
  • worst case
  • learning process
  • bayesian networks
  • reinforcement learning
  • learning tasks