Sign in
Arthur Conmy
Publication Activity (10 Years)
Years Active: 2021-2023
Publications (10 Years): 9
Top Topics
Emission Tomography
Regularization Methods
Semi Automated
Object Identification
Top Venues
CoRR
ICASSP
ICLR
NeurIPS
</>
Publications
</>
Rhys Gould
,
Euan Ong
,
George Ogden
,
Arthur Conmy
Successor Heads: Recurring, Interpretable Attention Heads In The Wild.
CoRR
(2023)
Kevin Ro Wang
,
Alexandre Variengien
,
Arthur Conmy
,
Buck Shlegeris
,
Jacob Steinhardt
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 Small.
ICLR
(2023)
Callum McDougall
,
Arthur Conmy
,
Cody Rushing
,
Thomas McGrath
,
Neel Nanda
Copy Suppression: Comprehensively Understanding an Attention Head.
CoRR
(2023)
Arthur Conmy
,
Augustine N. Mavor-Parker
,
Aengus Lynch
,
Stefan Heimersheim
,
Adrià Garriga-Alonso
Towards Automated Circuit Discovery for Mechanistic Interpretability.
CoRR
(2023)
Arthur Conmy
,
Augustine N. Mavor-Parker
,
Aengus Lynch
,
Stefan Heimersheim
,
Adrià Garriga-Alonso
Towards Automated Circuit Discovery for Mechanistic Interpretability.
NeurIPS
(2023)
Aaquib Syed
,
Can Rager
,
Arthur Conmy
Attribution Patching Outperforms Automated Circuit Discovery.
CoRR
(2023)
Kevin Wang
,
Alexandre Variengien
,
Arthur Conmy
,
Buck Shlegeris
,
Jacob Steinhardt
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small.
CoRR
(2022)
Arthur Conmy
,
Subhadip Mukherjee
,
Carola-Bibiane Schönlieb
Stylegan-Induced Data-Driven Regularization for Inverse Problems.
ICASSP
(2022)
Arthur Conmy
,
Subhadip Mukherjee
,
Carola-Bibiane Schönlieb
StyleGAN-induced data-driven regularization for inverse problems.
CoRR
(2021)