Login / Signup
Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning.
Shengguang Wu
Keming Lu
Benfeng Xu
Junyang Lin
Qi Su
Chang Zhou
Published in:
CoRR (2023)
Keyphrases
</>
data sets
data sources
database
prior knowledge
data collection
raw data
training data
data structure
data analysis
high quality
image data
input data
noisy data
data distribution
original data
missing data
synthetic data
statistical analysis
computer systems
data points
probability distribution
multimedia
databases