V-SMART-Join: A Scalable MapReduce Framework for All-Pair Similarity Joins of Multisets and Vectors
Ahmed MetwallyChristos FaloutsosPublished in: CoRR (2012)
Keyphrases
- similarity join
- mapreduce framework
- cloud computing
- join algorithms
- metric space
- similarity search
- edit distance
- large scale data sets
- vector space
- join operations
- bloom filter
- structural similarity
- xml data
- similar objects
- uncertain data
- pairwise
- cost model
- distance computation
- feature vectors
- high dimensional
- data management
- similarity measure
- data model
- locality sensitive hashing
- xml databases
- data sets
- b tree
- main memory
- dynamic programming