Login / Signup
Relative Entropy of Correct Proximal Policy Optimization Algorithms with Modified Penalty Factor in Complex Environment.
Weimin Chen
Kelvin Kian Loong Wong
Sifan Long
Zhili Sun
Published in:
Entropy (2022)
Keyphrases
</>
complex environments
relative entropy
learning algorithm
stochastic gradient
exponentiated gradient
sample size