Login / Signup
A Method for Speeding Up Value Iteration in Partially Observable Markov Decision Processes.
Nevin Lianwen Zhang
Stephen S. Lee
Weihong Zhang
Published in:
UAI (1999)
Keyphrases
</>
dynamic programming
probabilistic model
partially observable markov decision processes
reinforcement learning
decision making
knowledge base
infinite horizon