Login / Signup
A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms.
Shangtong Zhang
Romain Laroche
Harm van Seijen
Shimon Whiteson
Remi Tachet des Combes
Published in:
AAMAS (2022)
Keyphrases
</>
learning algorithm
neural network
reinforcement learning
computational complexity
support vector machine
optimization problems
approximate dynamic programming