Dual-Level Decoupled Transformer for Video Captioning.
Yiqi GaoXinglin HouWei SuoMengyang SunTiezheng GeYuning JiangPeng WangPublished in: CoRR (2022)
Keyphrases
- video sequences
- video content
- multimedia
- video data
- video analysis
- space time
- levels of abstraction
- key frames
- digital video
- video surveillance
- video frames
- power system
- spatial and temporal
- real time
- video streams
- video retrieval
- fault diagnosis
- fuzzy logic
- surveillance videos
- video processing
- video images
- online video