Sign in

Tricks for Training Sparse Translation Models.

Dheeru DuaShruti BhosaleVedanuj GoswamiJames CrossMike LewisAngela Fan
Published in: NAACL-HLT (2022)
Keyphrases
  • probabilistic model
  • online learning
  • computational models
  • machine learning
  • least squares
  • training examples
  • test set
  • process model
  • autoregressive
  • structured prediction
  • elastic net