MinJoin++: a fast algorithm for string similarity joins under edit distance.
Nikolai KarpovHaoyu ZhangQin ZhangPublished in: VLDB J. (2024)
Keyphrases
- similarity join
- edit distance
- graph matching
- string matching
- edit operations
- levenshtein distance
- string similarity
- similarity measure
- distance computation
- string edit distance
- tree structured data
- distance function
- dynamic programming
- distance measure
- hamming distance
- metric space
- pattern matching
- similarity search
- feature vectors
- neural network