Login / Signup
PrIC3: Property Directed Reachability for MDPs.
Kevin Batz
Sebastian Junges
Benjamin Lucien Kaminski
Joost-Pieter Katoen
Christoph Matheja
Philipp Schröer
Published in:
CoRR (2020)
Keyphrases
</>
state space
markov decision processes
reinforcement learning
dynamic programming
optimal policy
machine learning
multi agent
desirable properties
initial state
decision making
heuristic search
partially observable
average cost
factored mdps