Video2Commonsense: Generating Commonsense Descriptions to Enrich Video Captioning.
Zhiyuan FangTejas GokhalePratyay BanerjeeChitta BaralYezhou YangPublished in: CoRR (2020)
Keyphrases
- video data
- video streams
- natural language descriptions
- video content
- video sequences
- real time
- spatial temporal
- video frames
- online video
- video images
- space time
- multimedia
- video surveillance
- video analysis
- digital video
- data sets
- knowledge base
- spatial and temporal
- human activities
- computer vision
- video database
- video clips
- video summarization
- surveillance videos
- compressed video
- video retrieval
- real time video
- database
- video segmentation
- high level
- event detection
- action recognition
- website
- case study
- multi agent