Login / Signup
Fast and Safe Switching with Guaranteed Regret: LQR Setting with Unknown Dynamics.
Jafar Abbaszadeh Chekan
Cédric Langbort
Published in:
CoRR (2023)
Keyphrases
</>
upper confidence bound
online learning
dynamic model
multi armed bandit
lower bound
multi class
dynamical systems
regret bounds
mobile robot
least squares
binary classification
recurrent networks
contextual bandit