Login / Signup

Order-Optimal Regret in Distributed Kernel Bandits using Uniform Sampling with Shared Randomness.

Nikola PavlovicSudeep SalgiaQing Zhao
Published in: CoRR (2024)
Keyphrases
  • loss function
  • reinforcement learning
  • shortest path
  • convex optimization