Accelerating Asynchronous Stochastic Gradient Descent for Neural Machine Translation.
Nikolay BogoychevMarcin Junczys-DowmuntKenneth HeafieldAlham Fikri AjiPublished in: CoRR (2018)
Keyphrases
- machine translation
- stochastic gradient descent
- loss function
- least squares
- step size
- matrix factorization
- random forests
- natural language processing
- support vector machine
- cross lingual
- cross language information retrieval
- information extraction
- target language
- machine translation system
- multiple kernel learning
- regularization parameter
- online algorithms
- statistical machine translation
- natural language
- weight vector
- importance sampling
- collaborative filtering
- semi supervised
- pairwise
- data mining
- image restoration
- linear combination
- semi supervised learning