Spatial-Temporal Contextual Feature Fusion Network for Movie Description.

Yihui Liao Lu Fan Huiming Ding Zhifeng Xie

Published in: CICAI (1) (2022)

Keyphrases

spatial temporal
feature fusion
action recognition
spatial and temporal
spatio temporal
temporal information
human actions
video shots
high level
feature extraction
contextual information
knn
keypoints