Login / Signup

LLMCad: Fast and Scalable On-device Large Language Model Inference.

Daliang XuWangsong YinXin JinYing ZhangShiyun WeiMengwei XuXuanzhe Liu
Published in: CoRR (2023)
Keyphrases