Efficient Sequential and Parallel Algorithms for Incremental Record Linkage Using Complete Linkage Clustering.
Abdullah BaihanSanguthevar RajasekaranPublished in: BIBM (2019)
Keyphrases
- record linkage
- approximate matching
- duplicate detection
- data cleaning
- privacy preserving
- incremental clustering
- clustering method
- linked data
- entity resolution
- census data
- multiple databases
- clustering algorithm
- data points
- k means
- self organizing maps
- data processing
- disclosure risk
- efficient incremental
- case study
- machine learning
- group membership