AE: An Asymmetric Extremum content defined chunking algorithm for fast and bandwidth-efficient data deduplication.
Yucheng ZhangHong JiangDan FengWen XiaMin FuFangting HuangYukun ZhouPublished in: INFOCOM (2015)
Keyphrases
- input data
- data sets
- learning algorithm
- single pass
- database
- training data
- noisy data
- computationally efficient
- cost function
- network bandwidth
- data reduction
- data structure
- data analysis
- probabilistic model
- simulated annealing
- information loss
- k means
- data quality
- computational complexity
- dynamic programming
- worst case
- preprocessing
- spectral clustering
- segmentation algorithm
- similarity measure
- objective function
- spatial data
- synthetic data
- detection algorithm
- data processing
- information extraction
- data sources
- search space