Login / Signup
LSTP: Language-guided Spatial-Temporal Prompt Learning for Long-form Video-Text Understanding.
Yuxuan Wang
Yueqian Wang
Pengfei Wu
Jianxin Liang
Dongyan Zhao
Zilong Zheng
Published in:
CoRR (2024)
Keyphrases
</>
spatial temporal
text understanding
video shots
natural language processing
action recognition
computer vision
knowledge discovery
spatial and temporal
computational linguistics