Login / Signup
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism.
Mohammad Shoeybi
Mostofa Patwary
Raul Puri
Patrick LeGresley
Jared Casper
Bryan Catanzaro
Published in:
CoRR (2019)
Keyphrases
</>
language model
probabilistic model
language modeling
relevance model
information retrieval
mixture model
n gram
speech recognition
test collection
document retrieval
translation model
machine learning
text mining
query expansion
web documents
scoring function