Login / Signup
Transfer in Sequential Multi-Armed Bandits via Reward Samples.
Rahul N. R
Vaibhav Katewa
Published in:
ECC (2024)
Keyphrases
</>
multi armed bandits
bandit problems
multi armed bandit
decision problems
reinforcement learning
transfer learning
training samples
special case
machine learning
training set
least squares