Login / Signup

Exploiting the Structural Properties of the Underlying Markov Decision Problem in the Q-Learning Algorithm.

Sumit KunnumkalHuseyin Topaloglu
Published in: INFORMS J. Comput. (2008)
Keyphrases