SBBS: A sliding blocking algorithm with backtracking sub-blocks for duplicate data detection.
GuiPing WangShuyu ChenMingwei LinXiaowei LiuPublished in: Expert Syst. Appl. (2014)
Keyphrases
- detection algorithm
- input data
- data analysis
- data sets
- detection method
- cost function
- data reduction
- data collection
- noisy data
- dynamic programming
- database
- np hard
- search space
- optimal solution
- computational cost
- data quality
- matching algorithm
- high dimensional data
- data structure
- recognition algorithm
- data mining techniques
- original data
- information loss
- segmentation algorithm
- particle swarm optimization
- knowledge discovery
- k means
- training data
- learning algorithm
- expectation maximization
- simulated annealing
- worst case
- data distribution
- data points
- data sources
- neural network
- synthetic datasets
- equal sized