NLU on Data Diets: Dynamic Data Subset Selection for NLP Classification Tasks.
Jean-Michel AttenduJean-Philippe CorbeilPublished in: CoRR (2023)
Keyphrases
- data sets
- data sources
- data collection
- data processing
- statistical analysis
- synthetic data
- prior knowledge
- data analysis
- spatial data
- high quality
- raw data
- data structure
- knowledge discovery
- data quality
- privacy preserving
- experimental data
- question answering
- complex data
- small number
- data points
- xml documents
- search algorithm
- optimal solution
- metadata
- search engine
- machine learning