Approximate Thompson Sampling for Learning Linear Quadratic Regulators with O(√T) Regret.

Yeoneung Kim Gihun Kim Insoon Yang

Published in: CoRR (2024)

Keyphrases

online learning
learning algorithm
supervised learning
real time
image segmentation
reinforcement learning
video sequences
support vector
special case
dynamic programming
unsupervised learning
linear quadratic