Login / Signup
Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations.
Joni Pajarinen
Jaakko Peltonen
Ari Hottinen
Mikko A. Uusitalo
Published in:
ECML/PKDD (3) (2010)
Keyphrases
</>
partially observable markov decision processes
partially observable
reinforcement learning
belief state
markov decision problems
optimal policy
stochastic domains
state space
heuristic search
action selection
policy gradient
point based value iteration