Login / Signup
Order-Optimal Regret in Distributed Kernel Bandits using Uniform Sampling with Shared Randomness.
Nikola Pavlovic
Sudeep Salgia
Qing Zhao
Published in:
CoRR (2024)
Keyphrases
</>
loss function
reinforcement learning
shortest path
convex optimization