Login / Signup
CUP: Critic-Guided Policy Reuse.
Jin Zhang
Siyuan Li
Chongjie Zhang
Published in:
CoRR (2022)
Keyphrases
</>
policy gradient
actor critic
optimal policy
software reuse
genetic algorithm
expected cost
approximate dynamic programming
natural actor critic
machine learning
decision making
multi agent
temporal difference
asymptotically optimal
gradient method