C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Improved No-Regret Algorithms for Stochastic Shortest Path with Linear MDP.
Liyu Chen
Rahul Jain
Haipeng Luo
Published in:
CoRR (2021)
Keyphrases
</>
stochastic shortest path
learning algorithm
orders of magnitude
regret minimization
computational complexity
markov decision processes
neural network
machine learning
lower bound
np hard
reward function