Sign in

An Option-Dependent Analysis of Regret Minimization Algorithms in Finite-Horizon Semi-Markov Decision Processes.

Gianluca DrappoAlberto Maria MetelliMarcello Restelli
Published in: CoRR (2023)
Keyphrases
  • regret minimization
  • finite horizon
  • computational complexity
  • nash equilibrium
  • optimal policy
  • machine learning
  • learning algorithm
  • cooperative
  • sufficient conditions
  • game theoretic