Login / Signup
Improved No-Regret Algorithms for Stochastic Shortest Path with Linear MDP.
Liyu Chen
Rahul Jain
Haipeng Luo
Published in:
CoRR (2021)
Keyphrases
</>
stochastic shortest path
learning algorithm
orders of magnitude
regret minimization
computational complexity
markov decision processes
neural network
machine learning
lower bound
np hard
reward function