Login / Signup

Yuan 2.0-M32: Mixture of Experts with Attention Router.

Shaohua WuJiangang LuoXi ChenLingjun LiXudong ZhaoTong YuChao WangYue WangFei WangWeixu QiaoHoubo HeZeru ZhangZeyu SunJunxiong MaoChong Shen
Published in: CoRR (2024)
Keyphrases
  • mixture model
  • visual attention
  • focus of attention
  • data sets
  • expectation maximization
  • databases
  • real world
  • artificial intelligence
  • computer vision
  • evolutionary algorithm
  • knowledge acquisition
  • gaussian mixture