Login / Signup
On the Robustness of Epoch-Greedy in Multi-Agent Contextual Bandit Mechanisms.
Yinglun Xu
Bhuvesh Kumar
Jacob D. Abernethy
Published in:
CoRR (2023)
Keyphrases
</>
multi agent
contextual bandit
upper confidence bound
greedy algorithm
news recommendation
cooperative
search algorithm
multiple agents
machine learning
dynamic programming
knowledge base
probabilistic model
visual features
resolve conflicts