Efficient Deweahter Mixture-of-Experts with Uncertainty-Aware Feature-Wise Linear Modulation.
Rongyu ZhangYulin LuoJiaming LiuHuanrui YangZhen DongDenis A. GudovskiyTomoyuki OkunoYohei NakataKurt KeutzerYuan DuShanghang ZhangPublished in: AAAI (2024)