Sign in

Spatial-Temporal Contextual Feature Fusion Network for Movie Description.

Yihui LiaoLu FanHuiming DingZhifeng Xie
Published in: CICAI (1) (2022)
Keyphrases
  • spatial temporal
  • feature fusion
  • action recognition
  • spatial and temporal
  • spatio temporal
  • temporal information
  • human actions
  • video shots
  • high level
  • feature extraction
  • contextual information
  • knn
  • keypoints