Login / Signup
Partially observable Markov decision processes with reward information.
Xi-Ren Cao
Xianping Guo
Published in:
CDC (2004)
Keyphrases
</>
machine learning
dynamic programming
markov decision processes
average reward