NLU on Data Diets: Dynamic Data Subset Selection for NLP Classification Tasks.
Jean-Michel AttenduJean-Philippe CorbeilPublished in: SustaiNLP (2023)
Keyphrases
- natural language
- data sets
- information extraction
- original data
- raw data
- data quality
- database
- synthetic data
- data sources
- data structure
- artificial neural networks
- prior knowledge
- data analysis
- high quality
- experimental data
- databases
- data collection
- data points
- knowledge discovery
- sensor data
- genetic algorithm ga
- decision trees
- learning algorithm