Login / Signup

Tensor Attention Training: Provably Efficient Learning of Higher-order Transformers.

Jiuxiang GuYingyu LiangZhenmei ShiZhao SongYufa Zhou
Published in: CoRR (2024)
Keyphrases