Multi-level comparison of data deduplication in a backup scenario.
Dirk MeisterAndré BrinkmannPublished in: SYSTOR (2009)
Keyphrases
- data sets
- data processing
- prior knowledge
- raw data
- high quality
- statistical analysis
- database
- small number
- noisy data
- original data
- sensor data
- synthetic data
- data collection
- knowledge discovery
- neural network
- relational databases
- real world
- feature space
- data analysis
- bayesian networks
- domain experts
- data cleaning
- data objects
- information retrieval
- statistical methods
- temporal information
- experimental data
- information systems
- data points
- data structure
- input data
- data mining techniques
- image data
- xml documents