Temporal-enhanced Cross-modality Fusion Network for Video Sentence Grounding.
Zezhong LvBing SuPublished in: ICME (2023)
Keyphrases
- spatial and temporal
- temporal information
- network conditions
- temporal correlation
- peer to peer
- real time
- temporal consistency
- temporal structure
- temporal constraints
- video content
- multimedia
- video data
- spatio temporal
- space time
- video sequences
- data fusion
- temporal coherence
- network structure
- spatial temporal
- wireless sensor networks
- natural language
- video clips
- temporal domain
- computer networks
- temporal order
- video streams
- video frames
- temporal resolution
- video delivery
- multi modal fusion
- temporal analysis
- temporal relationships
- network resources
- temporal data
- temporal patterns
- network model
- complex networks
- image sequences
- neural network