Login / Signup
L*-Based Learning of Markov Decision Processes (Extended Version).
Martin Tappler
Bernhard K. Aichernig
Giovanni Bacci
Maria Eichlseder
Kim G. Larsen
Published in:
CoRR (2019)
Keyphrases
</>
markov decision processes
reinforcement learning
partially observable
state space
optimal policy
learning tasks
state abstraction
finite state
learning algorithm
model based reinforcement learning
linear program
policy iteration
finite horizon
actor critic
macro actions
action sets
real time dynamic programming