Streaming Similarity Search over one Billion Tweets using Parallel Locality-Sensitive Hashing.
Narayanan SundaramAizana TurmukhametovaNadathur SatishTodd MostakPiotr IndykSamuel MaddenPradeep DubeyPublished in: Proc. VLDB Endow. (2013)
Keyphrases
- similarity search
- locality sensitive hashing
- distance function
- hash functions
- metric space
- nearest neighbor search
- approximate similarity search
- high dimensional
- knn
- query processing
- high dimensional data
- indexing techniques
- multimedia databases
- approximate nearest neighbor
- similarity measure
- approximate nearest neighbor search
- vector space
- hash table
- binary codes
- r tree
- similarity queries
- distance computation
- data streams
- kd tree
- machine learning
- brute force
- nearest neighbor