Protein Domain Embeddings for Fast and Accurate Similarity Search.
Benjamin Giovanni IovinoHaixu TangYuzhen YePublished in: RECOMB (2024)
Keyphrases
- similarity search
- vector space
- high dimensional data
- mass spectra
- high dimensional
- metric space
- distance function
- query processing
- sequence databases
- multimedia databases
- efficient similarity search
- binary codes
- similarity measure
- low dimensional
- indexing techniques
- dynamic time warping
- similarity searching
- knn
- similarity queries
- r tree
- nearest neighbor
- efficient indexing
- cross view
- similarity retrieval
- protein structure
- dimensionality reduction
- hash functions
- nearest neighbor search
- locality sensitive hashing
- triangle inequality
- machine learning
- similarity search in metric spaces
- indexing structure
- amino acids
- high precision
- decision trees
- feature selection