Login / Signup
Efficient Policy Construction for MDPs Represented in Probabilistic PDDL.
Boris Lesner
Bruno Zanuttini
Published in:
ICAPS (2011)
Keyphrases
</>
optimal policy
markov decision processes
reinforcement learning
probabilistic model
infinite horizon
markov decision process
finite horizon
markov decision problems
utility function
average cost
planning systems
average reward
probabilistic planning
policy search