Generating Data to Mitigate Spurious Correlations in Natural Language Inference Datasets.
Yuxiang WuMatt GardnerPontus StenetorpPradeep DasigiPublished in: CoRR (2022)
Keyphrases
- data sets
- raw data
- data processing
- data analysis
- natural language
- synthetic data
- high quality
- complex data
- data quality
- data collection
- database
- image data
- experimental data
- databases
- original data
- small number
- semi supervised
- knowledge discovery
- data mining algorithms
- data distribution
- knowledge base
- training data
- data mining tasks
- training dataset
- experimental conditions