Sign in

ServerlessLLM: Locality-Enhanced Serverless Inference for Large Language Models.

Yao FuLeyang XueYeqi HuangAndrei-Octavian BrabeteDmitrii UstiugovYuvraj PatelLuo Mai
Published in: CoRR (2024)
Keyphrases