Login / Signup
Cascade Cross-modal Attention Network for Video Actor and Action Segmentation from a Sentence.
Weidong Chen
Guorong Li
Xinfeng Zhang
Hongyang Yu
Shuhui Wang
Qingming Huang
Published in:
ACM Multimedia (2021)
Keyphrases
</>
cross modal
perceptual information
multi modal
visual data
human actions
video data
video streams
multimedia
image segmentation
video retrieval
video content
space time
video analysis
video frames
computer vision
video sequences
high level
image retrieval
feature extraction
visual recognition
metadata