"Diversity and Uncertainty in Moderation" are the Key to Data Selection for Multilingual Few-shot Transfer.
Shanu KumarSandipan DandapatMonojit ChoudhuryPublished in: NAACL-HLT (Findings) (2022)
Keyphrases
- database
- data collection
- data sets
- high quality
- sensor data
- synthetic data
- image data
- experimental data
- small number
- data points
- data sources
- data analysis
- prior knowledge
- data structure
- raw data
- uncertain data
- relational databases
- high dimensional
- data mining techniques
- input data
- digital libraries
- spatial data
- databases
- data distribution