• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Looking Back on the Actor-Critic Architecture.

Andrew G. BartoRichard S. SuttonCharles W. Anderson
Published in: IEEE Trans. Syst. Man Cybern. Syst. (2021)
Keyphrases
  • actor critic
  • reinforcement learning
  • approximate dynamic programming
  • optimal control
  • temporal difference
  • neural network
  • gradient method
  • multi agent
  • simulated annealing
  • linear program