Columnar Storage Optimization and Caching for Data Lakes.
Guodong JinHaoqiong BianYueguo ChenXiaoyong DuPublished in: EDBT (2022)
Keyphrases
- data analysis
- data sets
- database
- data collection
- data distribution
- data processing
- data transfer
- data sources
- original data
- statistical analysis
- data points
- data quality
- optimization algorithm
- satellite data
- data storage
- high quality
- missing data
- synthetic data
- computer systems
- data structure
- input data
- data mining techniques
- response time
- image data