Spuriosity Rankings: Sorting Data for Spurious Correlation Robustness.
Mazda MoayeriWenxiao WangSahil SinglaSoheil FeiziPublished in: CoRR (2022)
Keyphrases
- data sets
- raw data
- data analysis
- data collection
- database
- data processing
- databases
- synthetic data
- correlation analysis
- data quality
- experimental data
- data points
- data sources
- high quality
- neural network
- knowledge discovery
- small number
- data warehouse
- prior knowledge
- attribute values
- application domains
- similarity measure
- statistical methods
- learning algorithm
- noisy data
- genetic algorithm
- historical data