Login / Signup
A Dominant Strategy Truthful, Deterministic Multi-Armed Bandit Mechanism with Logarithmic Regret.
Divya Padmanabhan
Satyanath Bhat
Prabuchandran K. J.
Shirish K. Shevade
Y. Narahari
Published in:
AAMAS (2017)
Keyphrases
</>
regret bounds
multi armed bandit
multi armed bandits
lower bound
online learning
linear regression
mechanism design
upper bound
worst case
reinforcement learning
least squares
learning algorithm
bregman divergences
decentralized decision making