Design Tradeoffs for Data Deduplication Performance in Backup Workloads.
Min FuDan FengYu HuaXubin HeZuoning ChenWen XiaYucheng ZhangYujuan TanPublished in: FAST (2015)
Keyphrases
- data sets
- synthetic data
- data processing
- data points
- computer systems
- image data
- training data
- original data
- statistical analysis
- complex data
- knowledge discovery
- data structure
- raw data
- database
- data collection
- databases
- record linkage
- data integrity
- statistical methods
- data distribution
- input data
- end users
- data sources
- user interface
- high quality