Scalable string similarity search/join with approximate seeds and multiple backtracking.
Enrico SiragusaDavid WeeseKnut ReinertPublished in: EDBT/ICDT Workshops (2013)
Keyphrases
- similarity search
- query processing
- similarity join
- distance function
- metric space
- high dimensional
- multimedia databases
- similarity measure
- distance computation
- efficient search
- similarity queries
- similarity searching
- efficient similarity search
- indexing techniques
- similarity retrieval
- hash functions
- high dimensional data
- edit distance
- cross view
- data sets
- approximate similarity search
- image data
- data structure
- pattern matching
- r tree
- join algorithms
- data mining
- locality sensitive hashing
- space partitioning
- triangle inequality
- databases
- feature selection
- image processing
- feature extraction
- search algorithm
- knn
- binary codes
- nearest neighbor search
- nearest neighbor