Login / Signup

Tender: Accelerating Large Language Models via Tensor Decomposition and Runtime Requantization.

Jungi LeeWonbeom LeeJaewoong Sim
Published in: ISCA (2024)
Keyphrases