C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations.
Joni Pajarinen
Jaakko Peltonen
Ari Hottinen
Mikko A. Uusitalo
Published in:
ECML/PKDD (3) (2010)
Keyphrases
</>
partially observable markov decision processes
partially observable
reinforcement learning
belief state
markov decision problems
optimal policy
stochastic domains
state space
heuristic search
action selection
policy gradient
point based value iteration