Login / Signup

CMR Scaling Law: Predicting Critical Mixture Ratios for Continual Pre-training of Language Models.

Jiawei GuZacc YangChuanghao DingRui ZhaoFei Tan
Published in: CoRR (2024)
Keyphrases