Sign in

Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts.

Sheng ShenLe HouYanqi ZhouNan DuShayne LongpreJason WeiHyung Won ChungBarret ZophWilliam FedusXinyun ChenTu VuYuexin WuWuyang ChenAlbert WebsonYunxuan LiVincent ZhaoHongkun YuKurt KeutzerTrevor DarrellDenny Zhou
Published in: CoRR (2023)
Keyphrases