Login / Signup

Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models.

Keisuke KamahoriYile GuKan ZhuBaris Kasikci
Published in: CoRR (2024)
Keyphrases
  • random fields
  • real time
  • probabilistic model
  • statistical model
  • statistical models
  • probabilistic inference
  • parallel implementation