Login / Signup
Critical Data Size of Language Models from a Grokking Perspective.
Xuekai Zhu
Yao Fu
Bowen Zhou
Zhouhan Lin
Published in:
CoRR (2024)
Keyphrases
</>
language model
training data
knowledge discovery
information retrieval
probabilistic model
n gram
speech recognition
language modeling
language modelling