FastQuery: Communication-efficient Embedding Table Query for Private LLM Inference.
Chenqi LinTianshi XuZebin YangRunsheng WangRu HuangMeng LiPublished in: CoRR (2024)
Keyphrases
- database
- query processing
- vector space
- communication cost
- query evaluation
- data sources
- user interaction
- response time
- xml query processing
- efficient learning
- indexing structure
- data retrieval
- communication systems
- communication networks
- web search
- indexing techniques
- multi dimensional
- complex queries
- query formulation
- query language
- data structure
- keywords