SPDO: High-throughput road distance computations on Spark using Distance Oracles.
Shangfu PengJagan SankaranarayananHanan SametPublished in: ICDE (2016)
Keyphrases
- high throughput
- distance computation
- distance function
- microarray
- genome wide
- biological data
- similarity measure
- similarity search
- k nearest neighbor
- nearest neighbor
- euclidean distance
- edit distance
- multi step
- complex objects
- dimensionality reduction
- data acquisition
- locality sensitive hashing
- similarity queries
- road network
- knn
- proteomic data
- nearest neighbor search
- gene expression
- microarray data
- feature construction
- mass spectrometry
- query processing
- machine learning