Login / Signup
Semantic Embedding Guided Attention with Explicit Visual Feature Fusion for Video Captioning.
Shanshan Dong
Tian-Zi Niu
Xin Luo
Wu Liu
Xinshun Xu
Published in:
ACM Trans. Multim. Comput. Commun. Appl. (2023)
Keyphrases
</>
feature fusion
multiple features
feature extraction
video data
video sequences
visual information
high level
semantic information
video frames
data sets
visual features
low level
fusion algorithm
key frames
low level features
multi sensor
face recognition
fusion method
data mining