Login / Signup

Pruning One More Token is Enough: Leveraging Latency-Workload Non-Linearities for Vision Transformers on the Edge.

Nick John EliopoulosPurvish JajalJames C. DavisGaowen LiuGeorge K. ThiravathukalYung-Hsiang Lu
Published in: CoRR (2024)
Keyphrases