Data and its (dis)contents: A survey of dataset development and use in machine learning research.
Amandalynne PaulladaInioluwa Deborah RajiEmily M. BenderEmily DentonAlex HannaPublished in: Patterns (2021)
Keyphrases
- machine learning
- database
- data sets
- data analysis
- data collection
- data processing
- knowledge discovery
- original data
- data sources
- case study
- data quality
- high quality
- training data
- metadata
- statistical analysis
- synthetic data
- end users
- image data
- neural network
- experimental data
- synthetic datasets
- computer science
- raw data
- computer systems
- input data
- natural language processing
- prior knowledge
- data structure