Login / Signup
Trust Region Bounds for Decentralized PPO Under Non-stationarity.
Mingfei Sun
Sam Devlin
Jacob Beck
Katja Hofmann
Shimon Whiteson
Published in:
AAMAS (2023)
Keyphrases
</>
trust region
global optimum
column generation
optimization methods
upper bound
log likelihood
hessian matrix
newton method
levenberg marquardt
lower bound
line search
least squares
mean shift
optimization problems
sample size
branch and bound
optimal solution
global convergence
support vector