Interpreting Attention Layer Outputs with Sparse Autoencoders.

Connor Kissane Robert Krzyzanowski Joseph Isaac Bloom Arthur Conmy Neel Nanda

Published in: CoRR (2024)

Keyphrases

denoising
sparse data
compressed sensing
neural network
high dimensional
compressive sensing
hidden nodes
restricted boltzmann machine
database
search engine
video sequences
artificial neural networks
input output
multi layer
focus of attention
output layer