Login / Signup

ViLT-CLIP: Video and Language Tuning CLIP with Multimodal Prompt Learning and Scenario-Guided Optimization.

Hao WangFang LiuLicheng JiaoJiahao WangZehua HaoShuo LiLingling LiPuhua ChenXu Liu
Published in: AAAI (2024)
Keyphrases