Login / Signup
MMAD: Multi-modal Movie Audio Description.
Xiaojun Ye
Junhao Chen
Xiang Li
Haidong Xin
Chao Li
Sheng Zhou
Jiajun Bu
Published in:
LREC/COLING (2024)
Keyphrases
</>
multi modal
audio visual
cross modal
single modality
multimedia
multi modality
high dimensional
semantic concepts
video search
high level
image annotation
audio features
visual information
fusing multiple
image processing
humanoid robot
face recognition