Login / Signup
A Multi-armed Bandit Algorithm Available in Stationary or Non-stationary Environments Using Self-organizing Maps.
Nobuhito Manome
Shuji Shinohara
Kouta Suzuki
Kosuke Tomonaga
Shunji Mitsuyoshi
Published in:
ICANN (1) (2019)
Keyphrases
</>
learning algorithm
k means
optimal solution
computational complexity
expectation maximization
multi armed bandit
special case
worst case
online learning
active learning
probabilistic model
non stationary
distance metric