Login / Signup

Performance in Multi-Armed Bandit Tasks in Relation to Ambiguity-Preference Within a Learning Algorithm.

Song-Ju KimTaiki Takahashi
Published in: Frontiers Appl. Math. Stat. (2018)
Keyphrases
  • learning algorithm
  • multi armed bandit
  • training data
  • multi armed bandits
  • machine learning
  • supervised learning
  • reinforcement learning
  • preference relations
  • data points