Sign in

MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation.

Simiao ZuoQingru ZhangChen LiangPengcheng HeTuo ZhaoWeizhu Chen
Published in: CoRR (2022)
Keyphrases
  • mixture model
  • relative importance
  • decision making
  • multimedia
  • expectation maximization
  • adaptation strategies
  • data mining
  • learning algorithm
  • computer vision
  • multi agent systems
  • domain experts