Login / Signup
Performance in Multi-Armed Bandit Tasks in Relation to Ambiguity-Preference Within a Learning Algorithm.
Song-Ju Kim
Taiki Takahashi
Published in:
Frontiers Appl. Math. Stat. (2018)
Keyphrases
</>
learning algorithm
multi armed bandit
training data
multi armed bandits
machine learning
supervised learning
reinforcement learning
preference relations
data points