Login / Signup

Offline Multi-Policy Gradient for Latent Mixture Environments.

Xiaoguang LiXin ZhangLixin WangGe Yu
Published in: IEEE Access (2021)
Keyphrases
  • policy gradient
  • parametric optimization
  • latent variables
  • actor critic
  • neural network
  • dynamic environments
  • function approximation
  • gradient method
  • reinforcement learning
  • expectation maximization
  • optimal control