Efficient Processing of Hamming-Distance-Based Similarity-Search Queries Over MapReduce.
MingJie TangYongyang YuWalid G. ArefQutaibah M. MalluhiMourad OuzzaniPublished in: EDBT (2015)
Keyphrases
- document collections
- efficient processing
- search queries
- information retrieval systems
- distance measure
- relevant documents
- query terms
- range queries
- query processing
- similarity measure
- efficient implementation
- search engine
- web search
- web users
- query logs
- user queries
- query expansion
- distance function
- join algorithms
- web search engines
- database
- multi dimensional
- keyword search
- information access
- vector space
- information sources
- data streams
- keywords
- multimedia
- data mining
- keyword queries
- databases