On the Impact of Data Augmentation on Downstream Performance in Natural Language Processing.
Itsuki OkimuraMachel ReidMakoto KawanoYutaka MatsuoPublished in: Insights@ACL (2022)
Keyphrases
- data sets
- natural language processing
- synthetic data
- prior knowledge
- database
- data collection
- data processing
- experimental data
- training data
- data structure
- data sources
- small number
- raw data
- statistical analysis
- data points
- data analysis
- complex data
- historical data
- textual data
- original data
- computational linguistics
- input data
- missing data
- data mining
- knowledge discovery
- bayesian networks
- web pages