Login / Signup
Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER.
Markus Holzleitner
Lukas Gruber
José Antonio Arjona-Medina
Johannes Brandstetter
Sepp Hochreiter
Published in:
Trans. Large Scale Data Knowl. Centered Syst. (2021)
Keyphrases
</>
convergence proof
reinforcement learning
actor critic
multi agent