Login / Signup

EdgeMoE: Fast On-Device Inference of MoE-based Large Language Models.

Rongjie YiLiwei GuoShiyun WeiAo ZhouShangguang WangMengwei Xu
Published in: CoRR (2023)
Keyphrases