Login / Signup
Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance.
Jiasheng Ye
Peiju Liu
Tianxiang Sun
Yunhua Zhou
Jun Zhan
Xipeng Qiu
Published in:
CoRR (2024)
Keyphrases
</>
data mining
data analysis
knowledge discovery
high dimensional data
language modeling
information retrieval
data points
mixture model