Login / Signup
Learning-Based Control Policy and Regret Analysis for Online Quadratic Optimization With Asymmetric Information Structure.
Cheng Tan
Lin Yang
Wing Shing Wong
Published in:
IEEE Trans. Cybern. (2022)
Keyphrases
</>
online learning
quadratic optimization
learning algorithm
reinforcement learning
supervised learning
decision trees
active learning
learning tasks
long run
control policy