Data-Juicer: A One-Stop Data Processing System for Large Language Models.
Daoyuan ChenYilun HuangZhijian MaHesen ChenXuchen PanCe GeDawei GaoYuexiang XieZhaoyang LiuJinyang GaoYaliang LiBolin DingJingren ZhouPublished in: SIGMOD Conference Companion (2024)