Login / Signup
Actor-Critic Sequence Generation for Relative Difference Captioning.
Zhengcong Fei
Published in:
ICMR (2020)
Keyphrases
</>
actor critic
reinforcement learning
optimal control
approximate dynamic programming
temporal difference
multi agent
control system
function approximation
reinforcement learning algorithms