Login / Signup
Self-Influence Guided Data Reweighting for Language Model Pre-training.
Megh Thakkar
Tolga Bolukbasi
Sriram Ganapathy
Shikhar Vashishth
Sarath Chandar
Partha Talukdar
Published in:
EMNLP (2023)
Keyphrases
</>
language model
language modeling
n gram
training set
high dimensional
co occurrence
cross lingual