Login / Signup

On the Convergence Rate of the Stochastic Gradient Descent (SGD) and application to a modified policy gradient for the Multi Armed Bandit.

Stefana AnitaGabriel Turinici
Published in: CoRR (2024)
Keyphrases