DZIP: A Data Deduplication-Compatible Enhanced Version of Gzip.
Hengying XiaoYangyang LiuPublished in: AIS&P (1) (2023)
Keyphrases
- data collection
- data sets
- database
- complex data
- data quality
- statistical analysis
- data analysis
- data mining techniques
- real time
- knowledge discovery
- prior knowledge
- high quality
- training data
- information systems
- data points
- image data
- data sources
- synthetic data
- missing values
- raw data
- original data
- big data
- historical data
- data cleaning