streamingRPHash: Random Projection Clustering of High-Dimensional Data in a MapReduce Framework.
Jacob FranklinSamuel WenkeSadiq QuasemLee A. CarraherPhilip A. WilseyPublished in: CLUSTER (2016)
Keyphrases
- random projections
- mapreduce framework
- cloud computing
- large scale data sets
- dimensionality reduction
- dimension reduction
- original data
- sparse representation
- image reconstruction
- frequent itemset mining
- principal component analysis
- low dimensional
- random sampling
- data management
- document clustering
- high dimensionality
- pattern recognition
- high dimensional
- data sets
- hash functions
- information retrieval
- neural network
- databases