Login / Signup
Pure Exploration Bandit Problem with General Reward Functions Depending on Full Distributions.
Siwei Wang
Wei Chen
Published in:
CoRR (2021)
Keyphrases
</>
reward function
pairwise
dynamic systems
state space
transition probabilities