Login / Signup
HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts.
Truong Do
Le Khiem
Quang Pham
TrungTin Nguyen
Thanh-Nam Doan
Binh Nguyen
Chenghao Liu
Savitha Ramasamy
Xiaoli Li
Steven C. H. Hoi
Published in:
EMNLP (2023)
Keyphrases
</>
mixture model
gaussian mixture model
efficient learning
neural network
high dimensional
structured prediction
bayesian networks
artificial neural networks
graphical models
online learning
bayesian inference
training process
training algorithm
reduced set