Login / Signup

Learning an Optimal Control Policy for a Markov Decision Process Under Linear Temporal Logic Specifications.

Masaki HiromotoToshimitsu Ushio
Published in: SSCI (2015)
Keyphrases
  • control policy
  • reinforcement learning
  • learning algorithm
  • linear temporal logic
  • high level
  • search algorithm
  • dynamic programming
  • general purpose
  • approximate dynamic programming
  • long run
  • control policies