Rectified Pessimistic-Optimistic Learning for Stochastic Continuum-armed Bandit with Constraints.

Hengquan Guo Zhu Qi Xin Liu

Published in: L4DC (2023)

Keyphrases

learning algorithm
learning process
learning systems
incremental learning
monte carlo
learning automata
random sampling
learning tasks
background knowledge
knowledge acquisition
markov chain
state space
stereo images
learning analytics
learning community
multi agent
machine learning