Talisman: a JavaScript archive of fuzzy matching, information retrieval and record linkage building blocks.
Guillaume PliquePublished in: J. Open Source Softw. (2020)
Keyphrases
- metadata
- building blocks
- record linkage
- approximate matching
- information retrieval
- entity resolution
- duplicate detection
- privacy preserving
- fuzzy sets
- candidate matches
- data cleaning
- record pairs
- matching algorithm
- text mining
- multiple databases
- databases
- disclosure risk
- fuzzy logic
- pattern matching
- fuzzy rules
- website
- information retrieval systems
- data sets
- search engine
- information extraction
- data warehouse
- web applications
- open source
- artificial intelligence
- census data
- graph matching
- software components
- database systems
- case study
- data integration
- feature points
- group membership