Video-GroundingDINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding.
Syed Talal WasimMuzammal NaseerSalman KhanMing-Hsuan YangFahad Shahbaz KhanPublished in: CoRR (2024)
Keyphrases
- spatio temporal
- spatial and temporal
- video data
- space time
- spatial temporal
- video content
- real time
- video streams
- video frames
- multimedia
- video clips
- video analysis
- video sequences
- video database
- video retrieval
- real time video
- key frames
- video representation
- video segmentation
- online video
- visual vocabulary
- video copy detection
- surveillance videos
- dynamic textures
- video shots
- event detection
- image sequences
- computer vision
- search engine