Login / Signup
Rethinking Multi-Modal Alignment in Multi-Choice VideoQA from Feature and Sample Perspectives.
Shaoning Xiao
Long Chen
Kaifeng Gao
Zhao Wang
Yi Yang
Zhimeng Zhang
Jun Xiao
Published in:
EMNLP (2022)
Keyphrases
</>
multi modal
multi modality
audio visual
image annotation
humanoid robot
high dimensional
cross modal
single modality
high level
feature vectors
semantic concepts