Login / Signup
Pseudorehearsal in actor-critic agents.
Vladimir Marochko
Leonard Johard
Manuel Mazzara
Published in:
CoRR (2017)
Keyphrases
</>
actor critic
multi agent systems
multi agent
reinforcement learning
cooperative
multiple agents
decision making
policy gradient
neuro fuzzy
incomplete information
approximate dynamic programming
optimal control
dynamic environments
single agent
average reward
gradient method
neural network