Towards a statistical theory of data selection under weak supervision.
Germain KolossovAndrea MontanariPulkit TandonPublished in: CoRR (2023)
Keyphrases
- data sets
- statistical methods
- missing data
- data structure
- data analysis
- statistical analysis
- computer systems
- synthetic data
- data sources
- probability distribution
- data points
- statistical data
- original data
- data distribution
- data processing
- input data
- relational databases
- image data
- labeled data
- high dimensional
- high quality
- application domains
- training data
- social networks
- raw data
- information retrieval
- complex data
- statistical significance
- statistical information
- database