ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting.
Rui PanJipeng ZhangXingyuan PanRenjie PiXiaoyu WangTong ZhangPublished in: CoRR (2024)
Keyphrases
- data analysis
- database
- data sets
- raw data
- data structure
- statistical analysis
- genetic algorithm
- optimization algorithm
- data processing
- input data
- data mining techniques
- data quality
- experimental data
- small number
- knowledge discovery
- data sources
- data model
- training data
- feature space
- synthetic data
- attribute values
- high quality
- original data
- website
- big data
- complex data