Exploiting the Structural Properties of the Underlying Markov Decision Problem in the Q-Learning Algorithm.

Published in: INFORMS J. Comput. (2008)

Keyphrases