Sign in

Relative Entropy of Correct Proximal Policy Optimization Algorithms with Modified Penalty Factor in Complex Environment.

Weimin ChenKelvin Kian Loong WongSifan LongZhili Sun
Published in: Entropy (2022)
Keyphrases
  • complex environments
  • relative entropy
  • learning algorithm
  • stochastic gradient
  • exponentiated gradient
  • sample size