Login / Signup
Mansheej Paul
Publication Activity (10 Years)
Years Active: 2020-2024
Publications (10 Years): 13
Top Topics
Deep Learning
Run Length
Reference Models
Support Vector Regression
Top Venues
CoRR
NeurIPS
ICLR
</>
Publications
</>
Cody Blakeney
,
Mansheej Paul
,
Brett W. Larsen
,
Sean Owen
,
Jonathan Frankle
Does your data spark joy? Performance gains from domain upsampling at the end of training.
CoRR
(2024)
Zachary Ankner
,
Cody Blakeney
,
Kartik Sreenivasan
,
Max Marion
,
Matthew L. Leavitt
,
Mansheej Paul
Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models.
CoRR
(2024)
Dan Biderman
,
Jose Javier Gonzalez Ortiz
,
Jacob Portes
,
Mansheej Paul
,
Philip Greengard
,
Connor Jennings
,
Daniel King
,
Sam Havens
,
Vitaliy Chiley
,
Jonathan Frankle
,
Cody Blakeney
,
John P. Cunningham
LoRA Learns Less and Forgets Less.
CoRR
(2024)
Allan Raventós
,
Mansheej Paul
,
Feng Chen
,
Surya Ganguli
Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression.
CoRR
(2023)
Mansheej Paul
,
Feng Chen
,
Brett W. Larsen
,
Jonathan Frankle
,
Surya Ganguli
,
Gintare Karolina Dziugaite
Unmasking the Lottery Ticket Hypothesis: What's Encoded in a Winning Ticket's Mask?
ICLR
(2023)
Allan Raventós
,
Mansheej Paul
,
Feng Chen
,
Surya Ganguli
Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression.
NeurIPS
(2023)
Mansheej Paul
,
Feng Chen
,
Brett W. Larsen
,
Jonathan Frankle
,
Surya Ganguli
,
Gintare Karolina Dziugaite
Unmasking the Lottery Ticket Hypothesis: What's Encoded in a Winning Ticket's Mask?
CoRR
(2022)
Mansheej Paul
,
Brett W. Larsen
,
Surya Ganguli
,
Jonathan Frankle
,
Gintare Karolina Dziugaite
Lottery Tickets on a Data Diet: Finding Initializations with Sparse Trainable Networks.
CoRR
(2022)
Mansheej Paul
,
Brett W. Larsen
,
Surya Ganguli
,
Jonathan Frankle
,
Gintare Karolina Dziugaite
Lottery Tickets on a Data Diet: Finding Initializations with Sparse Trainable Networks.
NeurIPS
(2022)
Mansheej Paul
,
Surya Ganguli
,
Gintare Karolina Dziugaite
Deep Learning on a Data Diet: Finding Important Examples Early in Training.
NeurIPS
(2021)
Mansheej Paul
,
Surya Ganguli
,
Gintare Karolina Dziugaite
Deep Learning on a Data Diet: Finding Important Examples Early in Training.
CoRR
(2021)
Stanislav Fort
,
Gintare Karolina Dziugaite
,
Mansheej Paul
,
Sepideh Kharaghani
,
Daniel M. Roy
,
Surya Ganguli
Deep learning versus kernel learning: an empirical study of loss landscape geometry and the time evolution of the Neural Tangent Kernel.
NeurIPS
(2020)
Stanislav Fort
,
Gintare Karolina Dziugaite
,
Mansheej Paul
,
Sepideh Kharaghani
,
Daniel M. Roy
,
Surya Ganguli
Deep learning versus kernel learning: an empirical study of loss landscape geometry and the time evolution of the Neural Tangent Kernel.
CoRR
(2020)