Login / Signup

ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding.

Shuzhang ZhongZebin YangMeng LiRuihao GongRunsheng WangRu Huang
Published in: CoRR (2024)
Keyphrases
  • tree pruning
  • dynamic environments
  • computer architecture
  • parallel implementation
  • database
  • computer vision
  • decision making
  • web services
  • image quality