Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms.

Shenao Zhang Boyi Liu Zhaoran Wang Tuo Zhao

Published in: CoRR (2023)

Keyphrases

computational complexity
optimization problems
learning algorithm
multi agent
mathematical models