Login / Signup

RT-LM: Uncertainty-Aware Resource Management for Real-Time Inference of Language Models.

Yufei LiZexin LiWei YangCong Liu
Published in: CoRR (2023)
Keyphrases