Tracing and Removing Data Errors in Natural Language Generation Datasets.
Faisal LadhakEsin DurmusTatsunori HashimotoPublished in: CoRR (2022)
Keyphrases
- data sets
- natural language generation
- raw data
- database
- data analysis
- training data
- test data
- data sources
- data mining techniques
- high dimensional data
- input data
- data points
- data structure
- artificial intelligence
- image data
- data processing
- data collection
- xml documents
- domain experts
- cooperative
- original data
- errors occur