Login / Signup

A Decentralized Policy with Logarithmic Regret for a Class of Multi-Agent Multi-Armed Bandit Problems with Option Unavailability Constraints and Stochastic Communication Protocols.

Pathmanathan PankayarajD. H. S. MaithripalaJordan M. Berg
Published in: CDC (2020)
Keyphrases