Login / Signup
Learning by Repetition: Stochastic Multi-armed Bandits under Priming Effect.
Priyank Agrawal
Theja Tulabandhula
Published in:
CoRR (2020)
Keyphrases
</>
multi armed bandits
learning algorithm
reinforcement learning
learning process
dynamic programming
multi class
probability distribution
online learning