Login / Signup
Balancing Risk and Reward: A Batched-Bandit Strategy for Automated Phased Release.
Yufan Li
Jialiang Mao
Iavor Bojinov
Published in:
NeurIPS (2023)
Keyphrases
</>
bandit problems
reinforcement learning
information systems
fully automated
decision making
decision trees
search algorithm
data driven
search strategy
semi automated
portfolio management