Login / Signup
Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning.
Peng Jin
Jinfa Huang
Pengfei Xiong
Shangxuan Tian
Chang Liu
Xiangyang Ji
Li Yuan
Jie Chen
Published in:
CoRR (2023)
Keyphrases
</>
cross modal
perceptual information
learning algorithm
information retrieval
multimedia
video sequences
learning process
visual recognition
domain knowledge
multi modal
virtual environment
learning tasks
visual similarity