Sequence of Hashes Compression in Data De-duplication.
Subashini BalachandranCornel ConstantinescuPublished in: DCC (2008)
Keyphrases
- data sets
- data collection
- database
- input data
- training data
- data processing
- original data
- data sources
- sequential data
- data reduction
- experimental data
- high dimensional data
- statistical analysis
- image data
- prior knowledge
- data analysis
- video sequences
- databases
- knowledge discovery
- data points
- probability distribution
- image compression
- xml documents
- synthetic data
- sensor data
- data structure
- statistical methods
- high quality
- feature selection
- random projections