Maybe Only 0.5% Data is Needed: A Preliminary Exploration of Low Training Data Instruction Tuning.
Hao ChenYiming ZhangQi ZhangHantao YangXiaomeng HuXuetao MaYifan YanggongJunbo ZhaoPublished in: CoRR (2023)
Keyphrases
- training data
- data sets
- data collection
- raw data
- high quality
- image data
- data structure
- data analysis
- test data
- synthetic data
- data processing
- knowledge discovery
- database
- prior knowledge
- database systems
- original data
- noisy data
- neural network
- statistical analysis
- experimental data
- spatial data
- data quality
- data distribution
- sensor data
- attribute values
- training examples
- training samples
- input data
- feature space
- decision trees
- multimedia
- feature selection
- databases