Characterizing datasets for data deduplication in backup applications.
Nohhyun ParkDavid J. LiljaPublished in: IISWC (2010)
Keyphrases
- raw data
- data sets
- data processing
- database
- image data
- experimental data
- synthetic data
- high quality
- data analysis
- knowledge discovery
- data collection
- original data
- prior knowledge
- data mining tasks
- data structure
- small number
- spatial data
- massive data
- experimental conditions
- data quality
- neural network
- search engine
- privacy preserving
- database systems
- data points
- end users
- data sources