A comparison of learning performance in two-dimensional Q-learning by the difference of Q-values alignment.

Published in: Artif. Life Robotics (2012)

Keyphrases