Stubborn Lexical Bias in Data and Models.
Sofia SerranoJesse DodgeNoah A. SmithPublished in: CoRR (2023)
Keyphrases
- experimental data
- data sets
- data processing
- training data
- data collection
- accurate models
- original data
- synthetic data
- small number
- high quality
- historical data
- data quality
- raw data
- statistical models
- evolutionary algorithm
- database
- prior knowledge
- image data
- missing data
- data points
- data distribution
- databases
- input data
- data structure
- domain specific
- probability distribution
- predictive model
- learned models