Login / Signup

Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs.

Enshu LiuJunyi ZhuZinan LinXuefei NingMatthew B. BlaschkoShengen YanGuohao DaiHuazhong YangYu Wang
Published in: CoRR (2024)
Keyphrases