Identity Term Sampling for Measuring Gender Bias in Training Data.
Nasim SobhaniSarah Jane DelanyPublished in: AICS (2022)
Keyphrases
- training data
- sampled data
- decision trees
- data sets
- training set
- classification accuracy
- sample size
- learning algorithm
- test data
- sample selection
- identity management
- random sampling
- class labels
- monte carlo
- skewed data
- learned from training data
- training instances
- naive bayes
- prior knowledge
- neural network
- generalization error
- training dataset
- sampling algorithm
- labeled data
- query expansion
- model selection
- domain knowledge
- sampling strategies