Towards Schema Inference for Data Lakes.
Nour AlhammadAlex BogatuNorman W. PatonPublished in: CoRR (2022)
Keyphrases
- database
- data structure
- data analysis
- statistical analysis
- small number
- original data
- raw data
- data collection
- training data
- high quality
- synthetic data
- data processing
- image data
- satellite data
- complex data
- data mining techniques
- data sources
- search engine
- end users
- xml documents
- high dimensional
- spatial data
- experimental data
- statistical methods
- decision trees
- data mining
- semistructured data
- databases