Login / Signup
Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning.
Jiatai Huang
Yan Dai
Longbo Huang
Published in:
ICML (2023)
Keyphrases
</>
online learning
learning process
passive aggressive
learning algorithm
online training
online environment
learning systems
hybrid learning
learning tasks
learning problems
reinforcement learning
e learning
neural network
real time
learning analytics
case study
bandit problems