Machine Learning Data Practices through a Data Curation Lens: An Evaluation Framework.
Eshta BhardwajHarshit GujralSiyi WuCiara ZogheibTegan MaharajChristoph BeckerPublished in: CoRR (2024)
Keyphrases
- data sets
- machine learning
- data collection
- original data
- synthetic data
- knowledge discovery
- data analysis
- complex data
- data processing
- image data
- database
- data sources
- raw data
- cloud computing
- noisy data
- data objects
- data mining
- data distribution
- sensor data
- information retrieval
- statistical analysis
- relational databases
- computer vision
- training data
- data mining techniques
- text mining
- information extraction
- high quality
- computer science
- prior knowledge