Login / Signup
Approximate Thompson Sampling for Learning Linear Quadratic Regulators with O(√T) Regret.
Yeoneung Kim
Gihun Kim
Insoon Yang
Published in:
CoRR (2024)
Keyphrases
</>
online learning
learning algorithm
supervised learning
real time
image segmentation
reinforcement learning
video sequences
support vector
special case
dynamic programming
unsupervised learning
linear quadratic