Optimal partitioning of data chunks in deduplication systems.
Michael HirschAriel Ish-ShalomShmuel T. KleinPublished in: Discret. Appl. Math. (2016)
Keyphrases
- data sets
- high quality
- raw data
- training data
- computer systems
- data sources
- end users
- image data
- data collection
- data structure
- probability distribution
- complex data
- data distribution
- sensor data
- synthetic data
- statistical analysis
- data processing
- data mining techniques
- small number
- data points
- decision trees
- multimedia
- neural network