Login / Signup
Henry Sleight
Publication Activity (10 Years)
Years Active: 2024-2024
Publications (10 Years): 3
Top Topics
Document Length
Behavior Recognition
Training Examples
Language Modelling
Top Venues
CoRR
</>
Publications
</>
Matthias Gerstgrasser
,
Rylan Schaeffer
,
Apratim Dey
,
Rafael Rafailov
,
Henry Sleight
,
John Hughes
,
Tomasz Korbak
,
Rajashree Agrawal
,
Dhruv Pai
,
Andrey Gromov
,
Daniel A. Roberts
,
Diyi Yang
,
David L. Donoho
,
Sanmi Koyejo
Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data.
CoRR
(2024)
Abhay Sheshadri
,
Aidan Ewart
,
Phillip Guo
,
Aengus Lynch
,
Cindy Wu
,
Vivek Hebbar
,
Henry Sleight
,
Asa Cooper Stickland
,
Ethan Perez
,
Dylan Hadfield-Menell
,
Stephen Casper
Targeted Latent Adversarial Training Improves Robustness to Persistent Harmful Behaviors in LLMs.
CoRR
(2024)
Rylan Schaeffer
,
Dan Valentine
,
Luke Bailey
,
James Chua
,
Cristóbal Eyzaguirre
,
Zane Durante
,
Joe Benton
,
Brando Miranda
,
Henry Sleight
,
John Hughes
,
Rajashree Agrawal
,
Mrinank Sharma
,
Scott Emmons
,
Sanmi Koyejo
,
Ethan Perez
When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?
CoRR
(2024)