Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States.

Chayan Banerjee Zhiyong Chen Nasimul Noman

Published in: CDC (2023)

Keyphrases

learning algorithm
computational complexity
optimization problems
objective function
multi agent systems
reinforcement learning
cost function
simulated annealing
learning tasks