Login / Signup
Investigating the Impact of Action Representations in Policy Gradient Algorithms.
Jan Schneider
Pierre Schumacher
Daniel F. B. Häufle
Bernhard Schölkopf
Dieter Büchler
Published in:
CoRR (2023)
Keyphrases
</>
learning algorithm
natural gradient
machine learning
policy gradient
search algorithm
search space
support vector machine
worst case
gradient ascent