An efficient similarity join approach on large-scale high-dimensional data using random projection.
Youzhong MaRuiling ZhangShijie JiaYongxin ZhangXiaofeng MengPublished in: Concurr. Comput. Pract. Exp. (2019)
Keyphrases
- high dimensional data
- random projections
- dimensionality reduction
- low dimensional
- dimension reduction
- similarity join
- original data
- similarity search
- high dimensionality
- sparse representation
- high dimensional
- nearest neighbor
- principal component analysis
- data points
- data sets
- distance computation
- hash functions
- pattern recognition
- linear discriminant analysis
- metric space
- data distribution
- data analysis
- feature space
- distance function
- random sampling
- feature extraction
- feature selection
- image reconstruction
- microarray