The Dataset Multiplicity Problem: How Unreliable Data Impacts Predictions.
Anna P. MeyerAws AlbarghouthiLoris D'AntoniPublished in: CoRR (2023)
Keyphrases
- data sets
- data processing
- machine learning
- original data
- image data
- data analysis
- experimental data
- synthetic data
- high quality
- database
- complex data
- data points
- data structure
- statistical methods
- input data
- training data
- real world
- data distribution
- missing data
- training dataset
- application domains
- benchmark datasets
- small number
- knowledge discovery
- probability distribution
- prior knowledge