dSCAM: Finding Document Copies Across Multiple Databases.
Hector Garcia-MolinaLuis GravanoNarayanan ShivakumarPublished in: PDIS (1996)
Keyphrases
- multiple databases
- databases
- information retrieval
- record linkage
- web documents
- document images
- keywords
- retrieval systems
- knowledge discovery in databases
- multiple data sources
- information retrieval systems
- document collections
- document clustering
- decision trees
- expert systems
- association rules
- relational databases
- database