The Fine Line: Navigating Large Language Model Pretraining with Down-streaming Capability Analysis.
Chen YangJunzhuo LiXinyao NiuXinrun DuSongyang GaoHaoran ZhangZhaoliang ChenXingwei QuRuibin YuanYizhi LiJiaheng LiuStephen W. HuangShawn YueWenhu ChenJie FuGe ZhangPublished in: CoRR (2024)