Data Lakes: A Survey of Functions and Systems (Extended abstract).
Rihan HaiChristos KoutrasChristoph QuixMatthias JarkePublished in: ICDE (2024)
Keyphrases
- extended abstract
- data sets
- computer systems
- data collection
- data analysis
- database
- data structure
- original data
- image data
- domain experts
- synthetic data
- statistical analysis
- data distribution
- knowledge discovery
- data points
- data sources
- prior knowledge
- high quality
- storage systems
- raw data
- data quality
- big data
- complex data
- commercial systems
- data processing
- distributed systems
- dimensionality reduction
- training data
- database systems
- decision trees
- learning algorithm
- neural network
- databases