Scalable Block Scheduling for Efficient Multi-database Record Linkage.
Thilina RanbadugeDinusha VatsalanPeter ChristenPublished in: ICDM (2016)
Keyphrases
- record linkage
- database
- multiple databases
- data cleaning
- databases
- data model
- duplicate detection
- highly scalable
- lightweight
- approximate matching
- entity resolution
- relational databases
- privacy preserving
- resource allocation
- database management systems
- database applications
- linked data
- data management
- information extraction
- query language
- decision making