Login / Signup
Transfer in Sequential Multi-armed Bandits via Reward Samples.
Rahul N. R
Vaibhav Katewa
Published in:
CoRR (2024)
Keyphrases
</>
multi armed bandits
bandit problems
multi armed bandit
decision problems
reinforcement learning
training samples
transfer learning
decision trees
least squares
influence diagrams