Reclaiming the Digital Commons: A Public Data Trust for Training Data.
Alan ChanHerbie BradleyNitarshan RajkumarPublished in: CoRR (2023)
Keyphrases
- training data
- data sets
- data collection
- training examples
- prior knowledge
- raw data
- test data
- noisy data
- database
- data distribution
- data sources
- knowledge discovery
- data structure
- data mining
- labelled data
- data quality
- multimedia data
- network structure
- training samples
- data mining techniques
- learning algorithm
- data analysis