C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Abstract Value Iteration for Hierarchical Reinforcement Learning.
Kishor Jothimurugan
Osbert Bastani
Rajeev Alur
Published in:
CoRR (2020)
Keyphrases
</>
hierarchical reinforcement learning
average reward
markov decision processes
state abstraction
markov decision process
state space
reinforcement learning
policy iteration
model free
reward function
optimal policy
heuristic search
long run
infinite horizon
dynamic programming
initial state
dynamic environments