Login / Signup

PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU.

Yixin SongZeyu MiHaotong XieHaibo Chen
Published in: CoRR (2023)
Keyphrases