Towards a statistical theory of data selection under weak supervision.
Germain KolossovAndrea MontanariPulkit TandonPublished in: ICLR (2024)
Keyphrases
- data sets
- statistical analysis
- data collection
- original data
- experimental data
- synthetic data
- high quality
- noisy data
- data points
- raw data
- statistical inference
- data processing
- database
- high dimensional data
- data analysis
- neural network
- feature selection
- decision trees
- data objects
- data distribution
- application domains
- missing data
- small number
- end users
- high dimensional