Sign in
Darshil Doshi
Publication Activity (10 Years)
Years Active: 2021-2023
Publications (10 Years): 4
Top Topics
Stable Models
Automatic Initialization
General Theory
Belief Functions
Top Venues
CoRR
NeurIPS
</>
Publications
</>
Darshil Doshi
,
Tianyu He
,
Andrey Gromov
Critical Initialization of Wide and Deep Neural Networks using Partial Jacobians: General Theory and Applications.
NeurIPS
(2023)
Darshil Doshi
,
Aritra Das
,
Tianyu He
,
Andrey Gromov
To grok or not to grok: Disentangling generalization and memorization on corrupted algorithmic datasets.
CoRR
(2023)
Tianyu He
,
Darshil Doshi
,
Andrey Gromov
AutoInit: Automatic Initialization via Jacobian Tuning.
CoRR
(2022)
Darshil Doshi
,
Tianyu He
,
Andrey Gromov
Critical initialization of wide and deep neural networks through partial Jacobians: general theory and applications to LayerNorm.
CoRR
(2021)