Discounted UCB1-tuned for Q-learning.

Koki Saito Akira Notsu Katsuhiro Honda

Published in: SCIS&ISIS (2014)

Keyphrases