Login / Signup
Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers.
Shijie Geng
Peng Gao
Moitreya Chatterjee
Chiori Hori
Jonathan Le Roux
Yongfeng Zhang
Hongsheng Li
Anoop Cherian
Published in:
AAAI (2021)
Keyphrases
</>
multi modal
graph representation
semantic concepts
video search
audio visual
multi modality
metadata
multimedia
low level
mutual information
video data
multimedia data
video content
cross modal