Login / Signup
MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer.
Jianjian Cao
Peng Ye
Shengze Li
Chong Yu
Yansong Tang
Jiwen Lu
Tao Chen
Published in:
CoRR (2024)
Keyphrases
</>
computer vision
vision system
real time
multi modal
dynamic environments
fault diagnosis
artificial intelligence
image processing
search space
fuzzy logic
programming language
language learning
natural language
ontology matching