Projection Based Large Scale High-Dimensional Data Similarity Join Using MapReduce Framework.
Youzhong MaRuiling ZhangZhanyou CuiChunjie LinPublished in: IEEE Access (2020)
Keyphrases
- high dimensional data
- similarity join
- mapreduce framework
- similarity search
- cloud computing
- data sets
- low dimensional
- large scale data sets
- high dimensional
- dimensionality reduction
- metric space
- nearest neighbor
- data analysis
- data points
- edit distance
- data distribution
- distance function
- structural similarity
- real world
- database
- databases
- input data
- hash functions
- query optimization
- bloom filter
- locality sensitive hashing
- feature extraction