Finding Optimally Robust Data Mixtures via Concave Maximization.
Anvith ThudiChris J. MaddisonPublished in: CoRR (2024)
Keyphrases
- data sets
- data processing
- database
- high quality
- experimental data
- data quality
- training data
- raw data
- prior knowledge
- computer systems
- input data
- historical data
- complex data
- data collection
- end users
- data sources
- objective function
- information systems
- databases
- small number
- probability distribution
- statistical analysis
- multimedia data
- original data
- data objects
- database systems
- erroneous data