iDedup: latency-aware, inline data deduplication for primary storage.
Kiran SrinivasanTimothy BissonGarth R. GoodsonKaladhar VorugantiPublished in: FAST (2012)
Keyphrases
- data sets
- data collection
- data analysis
- data structure
- database
- raw data
- data processing
- complex data
- synthetic data
- input data
- high quality
- training data
- image data
- small number
- knowledge discovery
- data transfer
- computer systems
- data mining
- bandwidth consumption
- data cleaning
- data retrieval
- storage space
- data objects
- original data
- experimental data
- sensor data
- statistical analysis
- association rules
- data sources
- data points
- high speed