Login / Signup

On optimal foraging and multi-armed bandits.

Vaibhav SrivastavaPaul ReverdyNaomi Ehrich Leonard
Published in: Allerton (2013)
Keyphrases
  • multi armed bandits
  • dynamic programming
  • bandit problems
  • least squares
  • closed form
  • swarm intelligence
  • multi armed bandit
  • reinforcement learning
  • optimal solution
  • multi objective