Login / Signup

Compress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable Prompt.

Zhaozhuo XuZirui LiuBeidi ChenYuxin TangJue WangKaixiong ZhouXia HuAnshumali Shrivastava
Published in: CoRR (2023)
Keyphrases