Login / Signup

SeqVAE: Sequence variational autoencoder with policy gradient.

Ting GaoYidong CuiFanyu Ding
Published in: Appl. Intell. (2021)
Keyphrases
  • policy gradient
  • image segmentation
  • neural network
  • function approximation
  • optimal control
  • reinforcement learning
  • gradient method
  • actor critic
  • model free reinforcement learning
  • average reward