A Global Survey on Data Deduplication.
Shubhanshi SinghalPooja SharmaRajesh Kumar AggarwalVishal PassrichaPublished in: Int. J. Grid High Perform. Comput. (2018)
Keyphrases
- complex data
- data collection
- data sets
- synthetic data
- statistical analysis
- computer systems
- data analysis
- data sources
- database
- image data
- data quality
- data processing
- high quality
- historical data
- big data
- missing values
- application domains
- high dimensional data
- training data
- input data
- data mining techniques
- knowledge discovery
- database systems
- spatial data
- experimental data
- raw data
- noisy data
- prior knowledge