Hashing-Based Distributed Clustering for Massive High-Dimensional Data.
Yifeng XiaoJiang XueDeyu MengPublished in: CoRR (2023)
Keyphrases
- high dimensional data
- data analysis
- high dimensionality
- similarity search
- nearest neighbor
- dimensionality reduction
- subspace clustering
- data points
- high dimensional
- low dimensional
- high dimensions
- data sets
- high dimensional datasets
- dimension reduction
- clustering high dimensional data
- sparse representation
- linear discriminant analysis
- nearest neighbor search
- input data
- high dimensional data sets
- random projections
- nonlinear dimensionality reduction
- knn
- lower dimensional
- hash functions
- input space
- manifold learning
- original data
- data distribution
- text data
- data structure
- high dimensional spaces
- feature selection
- high dimensional data analysis
- high dimensional feature spaces
- small sample size
- binary codes
- dimensional data
- cluster analysis
- image data
- training set
- pattern recognition
- training data
- neural network