Login / Signup

Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance.

Jiasheng YePeiju LiuTianxiang SunYunhua ZhouJun ZhanXipeng Qiu
Published in: CoRR (2024)
Keyphrases
  • data mining
  • data analysis
  • knowledge discovery
  • high dimensional data
  • language modeling
  • information retrieval
  • data points
  • mixture model