We Need to Talk About Data: The Importance of Data Readiness in Natural Language Processing.
Fredrik OlssonMagnus SahlgrenPublished in: CoRR (2021)
Keyphrases
- data sets
- data collection
- data structure
- data analysis
- high quality
- training data
- data sources
- high dimensional data
- data processing
- complex data
- database
- synthetic data
- data mining techniques
- dimensionality reduction
- small number
- image data
- knowledge discovery
- data points
- probability distribution
- missing data
- experimental data
- network structure
- raw data
- data quality
- clustering algorithm