Login / Signup
Learning an Optimal Control Policy for a Markov Decision Process Under Linear Temporal Logic Specifications.
Masaki Hiromoto
Toshimitsu Ushio
Published in:
SSCI (2015)
Keyphrases
</>
control policy
reinforcement learning
learning algorithm
linear temporal logic
high level
search algorithm
dynamic programming
general purpose
approximate dynamic programming
long run
control policies