Vector Embeddings by Sequence Similarity and Context for Improved Compression, Similarity Search, Clustering, Organization, and Manipulation of cDNA Libraries.
Daniel H. UmDavid A. KnowlesGail E. KaiserPublished in: CoRR (2023)
Keyphrases
- similarity search
- vector space
- high dimensional data
- streaming time series
- sequence similarity
- high dimensional
- similarity measure
- query processing
- metric space
- low dimensional
- distance function
- knn
- dimensionality reduction
- similarity searching
- microarray data
- indexing techniques
- binary codes
- similarity queries
- distance calculation
- data points
- hash functions
- nearest neighbor
- distance computation
- computational methods
- similarity computation
- pairwise
- pattern recognition
- high level
- decision trees