Login / Signup
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 Small.
Kevin Ro Wang
Alexandre Variengien
Arthur Conmy
Buck Shlegeris
Jacob Steinhardt
Published in:
ICLR (2023)
Keyphrases
</>
object identification
image matching
object recognition
object views
state space