Machine learning data practices through a data curation lens: An evaluation framework.
Eshta BhardwajHarshit GujralSiyi WuCiara ZogheibTegan MaharajChristoph BeckerPublished in: FAccT (2024)
Keyphrases
- data sets
- machine learning
- complex data
- synthetic data
- image data
- data analysis
- high quality
- knowledge discovery
- data processing
- data sources
- data quality
- data distribution
- databases
- raw data
- data mining
- end users
- data points
- experimental data
- supervised learning
- case study
- statistical analysis
- data acquisition
- text mining
- labeled data
- database
- data collection
- text classification
- data mining techniques
- natural language processing
- small number
- neural network
- xml documents
- data structure
- decision trees
- training data