Performance Investigation of UCB Policy in Q-learning.

Koki Saito Akira Notsu Seiki Ubukata Katsuhiro Honda

Published in: ICMLA (2015)

Keyphrases