Sign in

A Fast Optimization View: Reformulating Single Layer Attention in LLM Based on Tensor and SVM Trick, and Solving It in Matrix Multiplication Time.

Yeqi GaoZhao SongWeixin WangJunze Yin
Published in: CoRR (2023)
Keyphrases