Zero-Shot Data Maps. Efficient Dataset Cartography Without Model Training.
Angelo BasileMarc Franco-SalvadorPaolo RossoPublished in: EMNLP (Findings) (2023)
Keyphrases
- data sets
- experimental data
- prior knowledge
- simulation data
- database
- high level
- input data
- computational model
- data processing
- measured data
- empirical data
- probability distribution
- small number
- high quality
- training data
- mathematical model
- data quality
- original data
- network structure
- test data
- predictive model
- statistical analysis
- synthetic data
- data mining
- data analysis
- data collection
- image data
- knowledge discovery
- data sources
- data points
- probabilistic model
- labelled data
- neural network
- learning models
- expert knowledge
- raw data
- statistical model
- spatial data
- em algorithm
- high dimensional data