Login / Signup

LSTP: Language-guided Spatial-Temporal Prompt Learning for Long-form Video-Text Understanding.

Yuxuan WangYueqian WangPengfei WuJianxin LiangDongyan ZhaoZilong Zheng
Published in: CoRR (2024)
Keyphrases
  • spatial temporal
  • text understanding
  • video shots
  • natural language processing
  • action recognition
  • computer vision
  • knowledge discovery
  • spatial and temporal
  • computational linguistics