Login / Signup
Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs.
Young-Suk Lee
Md. Arafat Sultan
Yousef El-Kurdi
Tahira Naseem Asim Munawar
Radu Florian
Salim Roukos
Ramón Fernandez Astudillo
Published in:
CoRR (2023)
Keyphrases
</>
training data
data processing
data sets
data analysis
data collection
original data
machine learning
database
data structure
data sources
synthetic data
neural network
data distribution
discrete data
heterogeneous sources
missing data
mixture model
data points
high quality