Login / Signup

Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning.

Ted ZadouriAhmet ÜstünArash AhmadianBeyza ErmisAcyr LocatelliSara Hooker
Published in: CoRR (2023)
Keyphrases
  • cost effective
  • parameter tuning
  • neural network
  • domain knowledge
  • computationally efficient
  • parameter values