Login / Signup

Actor-Critic Sequence Generation for Relative Difference Captioning.

Zhengcong Fei
Published in: ICMR (2020)
Keyphrases
  • actor critic
  • reinforcement learning
  • optimal control
  • approximate dynamic programming
  • temporal difference
  • multi agent
  • control system
  • function approximation
  • reinforcement learning algorithms