Login / Signup

LLM in a flash: Efficient Large Language Model Inference with Limited Memory.

Keivan AlizadehIman MirzadehDmitry BelenkoKaren KhatamifardMinsik ChoCarlo C. Del MundoMohammad RastegariMehrdad Farajtabar
Published in: CoRR (2023)
Keyphrases