Sign in

Offline Stochastic Shortest Path: Learning, Evaluation and Towards Optimality.

Ming YinWenjing ChenMengdi WangYu-Xiang Wang
Published in: CoRR (2022)
Keyphrases
  • reinforcement learning
  • learning process
  • lower bound
  • active learning
  • supervised learning
  • markov chain
  • learning tasks