ChapGTP, ILLC's Attempt at Raising a BabyLM: Improving Data Efficiency by Automatic Task Formation.
Jaap JumeletMichael HannaMarianne de Heer KlootsAnna LangedijkCharlotte PouwOskar van der WalPublished in: CoRR (2023)
Keyphrases
- data sets
- image data
- synthetic data
- data collection
- data structure
- database
- data analysis
- data processing
- small number
- data sources
- end users
- sensor data
- input data
- data points
- high quality
- search engine
- learning algorithm
- test data
- multimedia data
- raw data
- data objects
- spatial data
- complex data
- knowledge discovery
- statistical analysis
- prior knowledge
- computational complexity
- training data
- data mining
- real time