Login / Signup

Impact of multi-armed bandit strategies on deep recurrent reinforcement learning.

Valentina ZangirolamiMatteo Borrotti
Published in: CoRR (2023)
Keyphrases
  • reinforcement learning
  • multi armed bandit
  • multi armed bandits
  • state space
  • model free
  • decentralized decision making
  • temporal difference
  • learning process
  • markov decision processes