SparkNN: A Distributed In-Memory Data Partitioning for KNN Queries on Big Spatial Data.
Zaher Al AghbariTasneem IsmailIbrahim KamelPublished in: Data Sci. J. (2020)
Keyphrases
- knn
- data partitioning
- spatial queries
- spatial data
- k nearest neighbor
- range queries
- similarity search
- query processing
- spatial databases
- query execution
- r tree
- nearest neighbor
- spatial database systems
- efficient processing
- on line analytical processing
- spatial objects
- indexing techniques
- distributed systems
- highly scalable
- distance function
- similarity queries
- index structure
- main memory
- metric space
- hierarchical clustering
- peer to peer
- spatial relationships
- cost model
- distributed environment
- text classification
- support vector machine
- data analysis
- location based services
- machine learning
- continuous queries
- response time
- query language