Login / Signup

Approximate Thompson Sampling for Learning Linear Quadratic Regulators with O(√T) Regret.

Yeoneung KimGihun KimInsoon Yang
Published in: CoRR (2024)
Keyphrases
  • online learning
  • learning algorithm
  • supervised learning
  • real time
  • image segmentation
  • reinforcement learning
  • video sequences
  • support vector
  • special case
  • dynamic programming
  • unsupervised learning
  • linear quadratic