Unnatural Language Processing: Bridging the Gap Between Synthetic and Natural Language Data.
Alana MarzoevSamuel MaddenM. Frans KaashoekMichael J. CafarellaJacob AndreasPublished in: CoRR (2020)
Keyphrases
- language processing
- natural language
- data sets
- raw data
- synthetic data
- database
- natural language processing
- data quality
- data collection
- data processing
- human language technology
- image data
- data points
- real world
- high quality
- training data
- neural network
- end users
- natural language text
- learning algorithm
- artificial intelligence
- experimental data
- sensor data
- missing data
- high dimensional data
- statistical analysis
- data structure
- input data
- relational databases
- data sources
- knowledge discovery