Login / Signup

Parm: Efficient Training of Large Sparsely-Activated Models with Dedicated Schedules.

Xinglin PanWenxiang LinShaohuai ShiXiaowen ChuWeinong SunBo Li
Published in: INFOCOM (2024)
Keyphrases