Login / Signup
Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference.
Jinghan Yao
Quentin Anthony
Aamir Shafi
Hari Subramoni
Dhabaleswar K. Panda
Published in:
CoRR (2024)
Keyphrases
</>
probabilistic model
multiscale
image compression
energy function
video streams