Sign in

Fast Distributed Inference Serving for Large Language Models.

Bingyang WuYinmin ZhongZili ZhangGang HuangXuanzhe LiuXin Jin
Published in: CoRR (2023)
Keyphrases