Quantized Random Projections and Non-Linear Estimation of Cosine Similarity.
Ping LiMichael MitzenmacherMartin SlawskiPublished in: NIPS (2016)
Keyphrases
- random projections
- cosine similarity
- document clustering
- dimensionality reduction
- dimension reduction
- distance measure
- similarity function
- tf idf
- vector space model
- random sampling
- similarity measure
- vector space
- k means
- semantic similarity
- image reconstruction
- sparse representation
- neural network
- text mining
- principal component analysis
- hash functions
- euclidean distance
- high dimensionality
- original data
- low dimensional
- multiscale
- learning algorithm
- machine learning
- text documents
- clustering method
- clustering algorithm