Dataset Similarity Detection for Global Deduplication in the DD File System.
Tony WongSmriti ThakkarKao-Feng HsiehZachary TomHetaben SaraiyaPhilip ShilanePublished in: ICDE (2023)
Keyphrases
- file system
- detection algorithm
- object detection
- detection method
- data transfer
- similarity measure
- continuous media
- storage devices
- naming conventions
- application specific
- false positives
- distance function
- face detection
- distance measure
- benchmark datasets
- access patterns
- object detectors
- anomaly detection
- metadata management
- general purpose
- flash memory
- storage systems
- database