Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods.

Seohong Park Jaekyeom Kim Gunhee Kim

Published in: CoRR (2021)

Keyphrases

policy gradient methods
natural actor critic
machine learning
neural network
naive bayes classifier
policy gradient