Login / Signup

A learning algorithm for the finite-time two-armed bandit problem.

Mitsuo SatoKenichi AbeHiroshi Takeda
Published in: IEEE Trans. Syst. Man Cybern. (1984)
Keyphrases