Login / Signup
What is the long-run distribution of stochastic gradient descent? A large deviations analysis.
Waïss Azizian
Franck Iutzeler
Jérôme Malick
Panayotis Mertikopoulos
Published in:
CoRR (2024)
Keyphrases
</>
long run
large deviations
stochastic gradient descent
pairwise
loss function
queueing networks