Login / Signup

Multi-armed Bandits with Generalized Temporally-Partitioned Rewards.

Ronald C. van den BroekRik LitjensTobias SagisNina VerbeekePratik Gajane
Published in: IDA (1) (2024)
Keyphrases
  • multi armed bandits
  • bandit problems
  • multi armed bandit
  • temporal information
  • decision problems