Login / Signup

Partially observable Markov decision processes with reward information.

Xi-Ren CaoXianping Guo
Published in: CDC (2004)
Keyphrases
  • machine learning
  • dynamic programming
  • markov decision processes
  • average reward