Sign in

Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States.

Chayan BanerjeeZhiyong ChenNasimul Noman
Published in: CDC (2023)
Keyphrases
  • learning algorithm
  • computational complexity
  • optimization problems
  • objective function
  • multi agent systems
  • reinforcement learning
  • cost function
  • simulated annealing
  • learning tasks