Login / Signup
Efficient Prompt Caching via Embedding Similarity.
Hanlin Zhu
Banghua Zhu
Jiantao Jiao
Published in:
CoRR (2024)
Keyphrases
</>
similarity measure
query processing
neural network
computationally expensive
prefetching
image segmentation
similarity metric
genetic algorithm
database systems
cost effective
data access