Login / Signup
Reward-Respecting Subtasks for Model-Based Reinforcement Learning.
Richard S. Sutton
Marlos C. Machado
G. Zacharias Holland
David Szepesvari
Finbarr Timbers
Brian Tanner
Adam White
Published in:
CoRR (2022)
Keyphrases
</>
model based reinforcement learning
reinforcement learning
markov decision processes
reward function
state space
function approximation
optimal policy
random walk
reinforcement learning algorithms
machine learning
finite state
long run
model free
markov decision problems