Login / Signup

FastQuery: Communication-efficient Embedding Table Query for Private LLM Inference.

Chenqi LinTianshi XuZebin YangRunsheng WangRu HuangMeng Li
Published in: CoRR (2024)
Keyphrases