Improved No-Regret Algorithms for Stochastic Shortest Path with Linear MDP.

Liyu Chen Rahul Jain Haipeng Luo

Published in: CoRR (2021)

Keyphrases

stochastic shortest path
learning algorithm
orders of magnitude
regret minimization
computational complexity
markov decision processes
neural network
machine learning
lower bound
np hard
reward function