BtrBlocks: Efficient Columnar Compression for Data Lakes.
Maximilian KuschewskiDavid SauerweinAdnan AlhomssiViktor LeisPublished in: Proc. ACM Manag. Data (2023)
Keyphrases
- data quality
- data sets
- database
- small number
- synthetic data
- data collection
- training data
- data processing
- knowledge discovery
- data structure
- prior knowledge
- high quality
- complex data
- raw data
- data sources
- image data
- input data
- computer systems
- sensor data
- historical data
- noisy data
- data compression
- data distribution
- experimental data
- multi dimensional
- data analysis
- database systems
- website
- data mining