Optimal Partitioning of Data Chunks in Deduplication Systems.
Michael HirschAriel Ish-ShalomShmuel Tomi KleinPublished in: Stringology (2013)
Keyphrases
- data sets
- data collection
- data analysis
- complex data
- data processing
- search engine
- high quality
- data quality
- computer systems
- original data
- input data
- missing values
- enormous amounts
- multimedia data
- raw data
- data objects
- storage systems
- spatial data
- missing data
- small number
- data points
- data sources
- wireless sensor networks
- xml documents
- expert systems
- social networks
- databases