Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning.

Published in: CoRR (2023)

Keyphrases