Login / Signup
On optimal foraging and multi-armed bandits.
Vaibhav Srivastava
Paul Reverdy
Naomi Ehrich Leonard
Published in:
Allerton (2013)
Keyphrases
</>
multi armed bandits
dynamic programming
bandit problems
least squares
closed form
swarm intelligence
multi armed bandit
reinforcement learning
optimal solution
multi objective