Login / Signup
Towards MoE Deployment: Mitigating Inefficiencies in Mixture-of-Expert (MoE) Inference.
Haiyang Huang
Newsha Ardalani
Anna Sun
Liu Ke
Hsien-Hsin S. Lee
Anjali Sridhar
Shruti Bhosale
Carole-Jean Wu
Benjamin Lee
Published in:
CoRR (2023)
Keyphrases
</>
inference process
expert systems
probabilistic inference
mixture model
gaussian mixture model
efficient learning
social networks
knowledge base
image segmentation
bayesian networks
information extraction
domain specific
expert knowledge
inference engine