Login / Signup
Adversarial Combinatorial Bandits with General Non-linear Reward Functions.
Xi Chen
Yanjun Han
Yining Wang
Published in:
CoRR (2021)
Keyphrases
</>
special case
reward function
markov decision processes
multi agent
hidden markov models
state space