Sign in
Tom Lieberum
Publication Activity (10 Years)
Years Active: 2021-2023
Publications (10 Years): 5
Top Topics
Human Learning
Precision Recall
Rule Base
Evaluation Measures
Top Venues
CoRR
ICLR
NeurIPS (Competition and Demos)
</>
Publications
</>
Tom Lieberum
,
Matthew Rahtz
,
János Kramár
,
Neel Nanda
,
Geoffrey Irving
,
Rohin Shah
,
Vladimir Mikulik
Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice Capabilities in Chinchilla.
CoRR
(2023)
Neel Nanda
,
Lawrence Chan
,
Tom Lieberum
,
Jess Smith
,
Jacob Steinhardt
Progress measures for grokking via mechanistic interpretability.
ICLR
(2023)
Neel Nanda
,
Lawrence Chan
,
Tom Lieberum
,
Jess Smith
,
Jacob Steinhardt
Progress measures for grokking via mechanistic interpretability.
CoRR
(2023)
Rohin Shah
,
Steven H. Wang
,
Cody Wild
,
Stephanie Milani
,
Anssi Kanervisto
,
Vinicius G. Goecks
,
Nicholas R. Waytowich
,
David Watkins-Valls
,
Bharat Prakash
,
Edmund Mills
,
Divyansh Garg
,
Alexander Fries
,
Alexandra Souly
,
Jun Shern Chan
,
Daniel del Castillo
,
Tom Lieberum
Retrospective on the 2021 BASALT Competition on Learning from Human Feedback.
CoRR
(2022)
Rohin Shah
,
Steven H. Wang
,
Cody Wild
,
Stephanie Milani
,
Anssi Kanervisto
,
Vinicius G. Goecks
,
Nicholas R. Waytowich
,
David Watkins-Valls
,
Bharat Prakash
,
Edmund Mills
,
Divyansh Garg
,
Alexander Fries
,
Alexandra Souly
,
Jun Shern Chan
,
Daniel del Castillo
,
Tom Lieberum
Retrospective on the 2021 MineRL BASALT Competition on Learning from Human Feedback.
NeurIPS (Competition and Demos)
(2021)