The Flan Collection: Designing Data and Methods for Effective Instruction Tuning.
Shayne LongpreLe HouTu VuAlbert WebsonHyung Won ChungYi TayDenny ZhouQuoc V. LeBarret ZophJason WeiAdam RobertsPublished in: CoRR (2023)
Keyphrases
- data sets
- data mining methods
- image data
- training data
- data analysis
- original data
- database
- data mining techniques
- synthetic data
- data processing
- significant improvement
- benchmark datasets
- knowledge discovery
- statistical methods
- data representations
- data reduction
- high quality
- noisy data
- prior knowledge
- raw data
- human experts
- data sources
- missing values
- experimental data
- attribute values
- document collections
- data objects
- labeled data
- data collection
- statistical tests
- statistical significance
- predictive model
- data points