Login / Signup

Grouped self-attention mechanism for a memory-efficient Transformer.

Bumjun JungYusuke MukutaTatsuya Harada
Published in: CoRR (2022)
Keyphrases