Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning.
Ming LiYong ZhangShwai HeZhitao LiHongyu ZhaoJianzong WangNing ChengTianyi ZhouPublished in: CoRR (2024)
Keyphrases
- data sets
- data quality
- image data
- raw data
- data analysis
- data sources
- data collection
- data processing
- data structure
- small number
- complex data
- experimental data
- attribute values
- knowledge discovery
- data model
- high quality
- training data
- neural network
- input data
- data mining techniques
- computer systems
- statistical analysis
- synthetic data
- missing data
- data distribution
- original data
- database