Learning Stochastic Optimal Policies via Gradient Descent.

Published in: IEEE Control. Syst. Lett. (2022)

Keyphrases