On Reality and the Limits of Language Data.
Nigel H. CollierFangyu LiuEhsan ShareghiPublished in: CoRR (2022)
Keyphrases
- data sets
- data collection
- data analysis
- prior knowledge
- original data
- application domains
- training data
- data processing
- data quality
- data structure
- databases
- small number
- probability distribution
- xml documents
- data points
- high quality
- data mining algorithms
- web services
- experimental data
- test data
- raw data
- website
- data objects
- machine learning
- big data
- complex data