Login / Signup
Looking Back on the Actor-Critic Architecture.
Andrew G. Barto
Richard S. Sutton
Charles W. Anderson
Published in:
IEEE Trans. Syst. Man Cybern. Syst. (2021)
Keyphrases
</>
actor critic
reinforcement learning
approximate dynamic programming
optimal control
temporal difference
neural network
gradient method
multi agent
simulated annealing
linear program