Login / Signup
SeqVAE: Sequence variational autoencoder with policy gradient.
Ting Gao
Yidong Cui
Fanyu Ding
Published in:
Appl. Intell. (2021)
Keyphrases
</>
policy gradient
image segmentation
neural network
function approximation
optimal control
reinforcement learning
gradient method
actor critic
model free reinforcement learning
average reward