FastCDC: a Fast and Efficient Content-Defined Chunking Approach for Data Deduplication.
Wen XiaYukun ZhouHong JiangDan FengYu HuaYuchong HuQing LiuYucheng ZhangPublished in: USENIX Annual Technical Conference (2016)
Keyphrases
- data sets
- original data
- data analysis
- synthetic data
- data collection
- user defined
- database
- end users
- probability distribution
- high quality
- raw data
- experimental data
- missing data
- statistical analysis
- data processing
- image data
- mobile devices
- data sources
- digital libraries
- multimedia data
- semi supervised
- complex data
- data retrieval
- data records