Login / Signup

A model-based reinforcement learning method based on conditional generative adversarial networks.

Tingting ZhaoYing WangGuixi LiLe KongYarui ChenYuan WangNing XieJucheng Yang
Published in: Pattern Recognit. Lett. (2021)
Keyphrases
  • prior knowledge
  • dynamic programming
  • objective function
  • probabilistic model
  • em algorithm