Login / Signup
Long-Term Exploration in Persistent MDPs.
Leonid Ugadiarov
Alexey Skrynnik
Aleksandr I. Panov
Published in:
CoRR (2021)
Keyphrases
</>
long term
markov decision processes
short term
model based reinforcement learning
reinforcement learning
state space
factored mdps
learning algorithm
probability distribution
markov decision process
markov decision problems
sufficient conditions
initial state
finite horizon
decision theoretic planning