Login / Signup
Graphical Object-Centric Actor-Critic.
Leonid Ugadiarov
Aleksandr I. Panov
Published in:
CoRR (2023)
Keyphrases
</>
actor critic
reinforcement learning
function approximation
temporal difference
simulated annealing
policy gradient