Increasing Entropy to Boost Policy Gradient Performance on Personalization Tasks.

Andrew Starnes Anton Dereventsov Clayton G. Webster

Published in: ICDM (Workshops) (2023)

Keyphrases

policy gradient
gradient method
model free reinforcement learning
neural network
control system
model checking
parametric optimization