Login / Signup
Attention-Based Multimodal Fusion for Video Description.
Chiori Hori
Takaaki Hori
Teng-Yok Lee
Ziming Zhang
Bret Harsham
John R. Hershey
Tim K. Marks
Kazuhiro Sumi
Published in:
ICCV (2017)
Keyphrases
</>
multimodal fusion
high robustness
audio visual
video sequences
relevance feedback
multimedia
video content
high level
video data
gait recognition
video frames
space time
low level
multimodal interfaces
machine learning
multi modal
visual data