Sign in
Contextual multi-armed bandit algorithms for personalized learning action selection.
Indu Manickam
Andrew S. Lan
Richard G. Baraniuk
Published in:
ICASSP (2017)
Keyphrases
</>
action selection
multi armed bandit
learning algorithm
reinforcement learning
decision making
collaborative filtering