Finding Themes in Medline Documents: Probabilistic Similarity Search.
Hagit ShatkayW. John WilburPublished in: ADL (2000)
Keyphrases
- similarity search
- vector space
- metric space
- distance function
- high dimensional
- multimedia databases
- similarity searching
- query processing
- efficient similarity search
- information retrieval
- indexing techniques
- high dimensional data
- similarity retrieval
- similarity measure
- relevant documents
- knn
- document collections
- nearest neighbor search
- hash functions
- similarity queries
- information retrieval systems
- r tree
- triangle inequality
- databases
- database
- uncertain data
- approximate similarity search
- content based retrieval
- query terms
- user queries
- locality sensitive hashing
- dimensionality reduction
- xml documents
- metadata
- neural network