Login / Signup
Gated Multi-modal Fusion with Cross-modal Contrastive Learning for Video Question Answering.
Chenyang Lyu
Wenxi Li
Tianbo Ji
Liting Zhou
Cathal Gurrin
Published in:
ICANN (7) (2023)
Keyphrases
</>
question answering
cross modal
multi modal fusion
information retrieval
information extraction
video data
visual recognition
computer vision
natural language
learning tasks
search engine
multi modal
text categorization
multimedia data
video analysis