Login / Signup

Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training.

Xianzhi DuTom GunterXiang KongMark LeeZirui WangAonan ZhangNan DuRuoming Pang
Published in: CoRR (2024)
Keyphrases