Login / Signup
Accelerating content-defined-chunking based data deduplication by exploiting parallelism.
Wen Xia
Dan Feng
Hong Jiang
Yucheng Zhang
Victor Chang
Xiangyu Zou
Published in:
Future Gener. Comput. Syst. (2019)
Keyphrases
</>
data sets
raw data
training data
missing data
data analysis
synthetic data
image data
multimedia data
statistical analysis
small number
data sources
metadata
database
data points
data structure
original data
knowledge discovery
input data
data processing
data model
sensor data
high quality
data cleaning