Login / Signup
Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER.
Markus Holzleitner
Lukas Gruber
Jose A. Arjona-Medina
Johannes Brandstetter
Sepp Hochreiter
Published in:
CoRR (2020)
Keyphrases
</>
convergence proof
actor critic
fuzzy logic
least squares
optimization methods