Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models.

Published in: CVPR (2023)

Keyphrases