VioLET: Vision-Language Efficient Tuning with Collaborative Multi-modal Gradients.

Published in: ACM Multimedia (2023)

Keyphrases