Login / Signup

Combining Global Sparse Gradients with Local Gradients in Distributed Neural Network Training.

Alham Fikri AjiKenneth HeafieldNikolay Bogoychev
Published in: EMNLP/IJCNLP (1) (2019)
Keyphrases
  • neural network training
  • neural network
  • machine learning
  • artificial intelligence
  • optimal solution
  • multi objective
  • linear combination