Asynchronous Local-SGD Training for Language Modeling.
Bo LiuRachita ChhapariaArthur DouillardSatyen KaleAndrei A. RusuJiajun ShenArthur SzlamMarc'Aurelio RanzatoPublished in: CoRR (2024)
Keyphrases
- language modeling
- language model
- information retrieval
- retrieval model
- cross lingual
- query expansion
- n gram
- stochastic gradient descent
- probabilistic model
- statistical language models
- text classification
- test collection
- training set
- loss function
- improvements in retrieval effectiveness
- word segmentation
- comparable corpora
- sentence retrieval
- relevance model
- metadata