Nuanced Metrics for Measuring Unintended Bias with Real Data for Text Classification.
Daniel BorkanLucas DixonJeffrey SorensenNithum ThainLucy VassermanPublished in: WWW (Companion Volume) (2019)
Keyphrases
- text classification
- text mining
- naive bayes
- text data
- text categorization
- feature selection
- bag of words
- text documents
- synthetic data
- labeled data
- machine learning
- semantic features
- knn
- data cleaning
- text classifiers
- n gram
- document classification
- evaluation metrics
- data sets
- term frequency
- neural network
- databases
- database
- similarity metrics
- variance reduction
- text classification tasks
- multi label
- semantic information
- language modeling
- learning algorithm
- real world