Stubborn Lexical Bias in Data and Models.
Sofia SerranoJesse DodgeNoah A. SmithPublished in: ACL (Findings) (2023)
Keyphrases
- data sets
- accurate models
- data processing
- prior knowledge
- synthetic data
- raw data
- data collection
- statistical models
- image data
- probability distribution
- probabilistic model
- statistical analysis
- sensor data
- statistical methods
- complex data
- missing data
- database
- data distribution
- historical data
- knowledge discovery
- data points
- data analysis
- privacy preserving
- association rules
- missing values
- data structure
- training data
- original data
- database systems
- learning models
- machine learning
- predictive model
- natural language text
- learned models
- neural network