High-performance probabilistic record linkage via multi-dimensional homomorphisms.
Ari RaschRichard SchulzeWaldemar GorusJan HillerSebastian BartholomäusSergei GorlatchPublished in: SAC (2019)
Keyphrases
- record linkage
- multi dimensional
- data cleaning
- duplicate detection
- entity resolution
- multiple databases
- privacy preserving
- linked data
- probabilistic model
- high dimensional
- bayesian networks
- generative model
- uncertain data
- census data
- group membership
- approximate matching
- machine learning
- probabilistic databases
- knowledge discovery
- e learning