Login / Signup
Yuan 2.0-M32: Mixture of Experts with Attention Router.
Shaohua Wu
Jiangang Luo
Xi Chen
Lingjun Li
Xudong Zhao
Tong Yu
Chao Wang
Yue Wang
Fei Wang
Weixu Qiao
Houbo He
Zeru Zhang
Zeyu Sun
Junxiong Mao
Chong Shen
Published in:
CoRR (2024)
Keyphrases
</>
mixture model
visual attention
focus of attention
data sets
expectation maximization
databases
real world
artificial intelligence
computer vision
evolutionary algorithm
knowledge acquisition
gaussian mixture