Co-Grounding Networks With Semantic Attention for Referring Expression Comprehension in Videos.
Sijie SongXudong LinJiaying LiuZongming GuoShih-Fu ChangPublished in: CVPR (2021)
Keyphrases
- video event
- natural language
- social networks
- video clips
- sports video
- semantic annotation
- semantic knowledge
- semantic video retrieval
- video database
- user generated
- semantic similarity
- human activities
- network structure
- video data
- video sequences
- semantic search
- low level features
- cognitive processes
- semantic concepts
- video analysis
- video content
- computer networks
- semantic information
- high level
- computer vision