Login / Signup
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small.
Kevin Wang
Alexandre Variengien
Arthur Conmy
Buck Shlegeris
Jacob Steinhardt
Published in:
CoRR (2022)
Keyphrases
</>
object identification
image matching
object recognition
computer vision
object views
feature space
markov random field