Login / Signup

FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning.

Xupeng MiaoGabriele OliaroXinhao ChengMengdi WuColin UngerZhihao Jia
Published in: CoRR (2024)
Keyphrases