Login / Signup
Grounding-Prompter: Prompting LLM with Multimodal Information for Temporal Sentence Grounding in Long Videos.
Houlun Chen
Xin Wang
Hong Chen
Zihan Song
Jia Jia
Wenwu Zhu
Published in:
CoRR (2023)
Keyphrases
</>
multimodal information
video data
visual data
video sequences
spatio temporal
space time
information retrieval
object recognition
domain knowledge
low level
video content