Login / Signup
Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods.
Riashat Islam
Raihan Seraj
Pierre-Luc Bacon
Doina Precup
Published in:
CoRR (2019)
Keyphrases
</>
state space
neural network
dynamic programming