Metadata Archaeology: Unearthing Data Subsets by Leveraging Training Dynamics.
Shoaib Ahmed SiddiquiNitarshan RajkumarTegan MaharajDavid KruegerSara HookerPublished in: CoRR (2022)
Keyphrases
- data collection
- data analysis
- data sets
- original data
- metadata
- raw data
- training data
- database
- data processing
- data quality
- neural network
- missing data
- labelled data
- information resources
- e learning
- data structure
- data distribution
- spatial data
- virtual reality
- xml documents
- training examples
- training samples
- computer systems
- digital libraries
- high dimensional
- input data
- prior knowledge