Login / Signup
A learning based approach to control synthesis of Markov decision processes for linear temporal logic specifications.
Dorsa Sadigh
Eric S. Kim
Samuel Coogan
S. Shankar Sastry
Sanjit A. Seshia
Published in:
CDC (2014)
Keyphrases
</>
markov decision processes
reinforcement learning
learning algorithm
state space
partially observable
finite state
heuristic search
control strategy
policy iteration
model based reinforcement learning
real time dynamic programming
optimal policy
linear temporal logic
transition matrices