Login / Signup
MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation.
Simiao Zuo
Qingru Zhang
Chen Liang
Pengcheng He
Tuo Zhao
Weizhu Chen
Published in:
CoRR (2022)
Keyphrases
</>
mixture model
relative importance
decision making
multimedia
expectation maximization
adaptation strategies
data mining
learning algorithm
computer vision
multi agent systems
domain experts