Login / Signup
Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret.
Jean Tarbouriech
Runlong Zhou
Simon S. Du
Matteo Pirotta
Michal Valko
Alessandro Lazaric
Published in:
NeurIPS (2021)
Keyphrases
</>
parameter free
stochastic shortest path
minimax regret
worst case
categorical data
markov decision processes
lower bound
outlier detection
fully automatic
utility function
reward function
machine learning
pairwise
evaluation function
markov decision problems