Data lake concept and systems: a survey.
Rihan HaiChristoph QuixMatthias JarkePublished in: CoRR (2021)
Keyphrases
- synthetic data
- data sets
- raw data
- statistical analysis
- data processing
- data analysis
- data points
- management system
- storage systems
- measured data
- historical data
- big data
- data quality
- original data
- database
- data distribution
- enormous amounts
- high dimensional data
- distributed systems
- small number
- data structure
- machine learning
- neural network
- prior knowledge
- training data
- decision trees
- data objects
- website
- multimedia
- complex data
- social networks
- information retrieval
- satellite data