Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning.
Pablo VillalobosJaime SevillaLennart HeimTamay BesirogluMarius HobbhahnAnson HoPublished in: CoRR (2022)
Keyphrases
- data analysis
- data sets
- statistical analysis
- machine learning
- data collection
- raw data
- training data
- data analysis tasks
- database
- synthetic data
- learning algorithm
- data sources
- attribute values
- empirical data
- complex data
- original data
- background knowledge
- data points
- data processing
- image data
- probability distribution
- computer systems
- structured data
- knowledge discovery
- spatial data
- experimental data
- data acquisition
- pattern recognition
- knowledge acquisition
- data quality
- information extraction
- correlation analysis