Login / Signup

Dictionary Learning Improves Patch-Free Circuit Discovery in Mechanistic Interpretability: A Case Study on Othello-GPT.

Zhengfu HeXuyang GeQiong TangTianxiang SunQinyuan ChengXipeng Qiu
Published in: CoRR (2024)
Keyphrases