TSCF: An Efficient Two-Stage Cuckoo Filter for Data Deduplication.
Tao LiuQinshu ChenHui LiBohui WangXin YangPublished in: MSN (2021)
Keyphrases
- data sets
- complex data
- raw data
- image data
- synthetic data
- data collection
- data analysis
- data quality
- experimental data
- high quality
- data structure
- neural network
- data points
- knowledge discovery
- database
- training data
- data cleaning
- data mining
- domain experts
- statistical analysis
- data processing
- input data
- data mining techniques
- probability distribution
- data sources