Login / Signup
Rectified Pessimistic-Optimistic Learning for Stochastic Continuum-armed Bandit with Constraints.
Hengquan Guo
Zhu Qi
Xin Liu
Published in:
L4DC (2023)
Keyphrases
</>
learning algorithm
learning process
learning systems
incremental learning
monte carlo
learning automata
random sampling
learning tasks
background knowledge
knowledge acquisition
markov chain
state space
stereo images
learning analytics
learning community
multi agent
machine learning