TempCompass: Do Video LLMs Really Understand Videos?
Yuanxin LiuShicheng LiYi LiuYuxiang WangShuhuai RenLei LiSishuo ChenXu SunLu HouPublished in: CoRR (2024)
Keyphrases
- video content
- video frames
- video sequences
- video data
- video analysis
- video database
- video clips
- key frames
- video indexing
- video editing
- event recognition
- online video
- youtube videos
- video streams
- video dataset
- temporal coherence
- input video
- video representation
- spatiotemporal features
- content based copy detection
- video event
- moving camera
- video material
- video retrieval
- space time
- video annotation
- video surveillance
- high definition
- video browsing
- video shots
- video search
- video images
- video segments
- instructional videos
- human activities
- semantic concept detection
- stereoscopic video
- dynamic scenes
- video summarization
- video classification
- temporal domain
- successive frames
- video sharing
- video stabilization
- natural language descriptions
- stationary camera
- foreground background segmentation
- visual analysis
- web videos
- video collections
- multimedia
- human actions
- static images
- motion features
- sports video
- video objects
- news video
- spatio temporal
- motion estimation
- video copy detection
- video scene
- event detection
- action detection
- video signals
- multimedia data
- camera motion
- surveillance videos
- action recognition