An Overview on Data Deduplication Techniques.
Xuecheng ZhangMingzhu DengPublished in: ITITS (2) (2015)
Keyphrases
- data sets
- small number
- image data
- complex data
- raw data
- database
- data sources
- data mining techniques
- high quality
- statistical analysis
- data analysis
- data collection
- missing data
- application domains
- experimental data
- data quality
- record linkage
- prior knowledge
- real time
- data mining
- data structure
- data points
- association rules
- data streams
- synthetic data
- input data
- training data
- case study
- data distribution
- data acquisition
- decision trees
- original data
- information systems