Login / Signup
Non-Stationary Bandits under Recharging Payoffs: Improved Planning with Sublinear Regret.
Orestis Papadigenopoulos
Constantine Caramanis
Sanjay Shakkottai
Published in:
CoRR (2022)
Keyphrases
</>
non stationary
adaptive algorithms
game theory
autoregressive
white noise
multi armed bandits
lower bound
online learning
empirical mode decomposition
worst case
loss function
planning problems
stock price
game theoretic
stochastic systems
multi armed bandit problems
regret bounds
hidden markov models