A Scalable Similarity Join Algorithm Based on MapReduce and LSH.
Sébastien RivaultMostafa BamhaSébastien LimetSophie RobertPublished in: Int. J. Parallel Program. (2022)
Keyphrases
- join algorithms
- map reduce
- similarity join
- join processing
- database query processing
- join operations
- similarity measure
- query processing
- xml databases
- main memory
- cost model
- xml queries
- b tree
- efficient processing
- locality sensitive hashing
- cloud computing
- distance function
- data intensive
- distance computation
- databases
- nearest neighbor
- parallel processing
- similarity search
- distance measure
- data sets
- query optimization
- data skew
- hash join