An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling.
Tsu-Jui FuLinjie LiZhe GanKevin LinWilliam Yang WangLijuan WangZicheng LiuPublished in: CVPR (2023)
Keyphrases
- end to end
- scalable video
- wireless ad hoc networks
- admission control
- ad hoc networks
- visual information
- video sequences
- high bandwidth
- video data
- congestion control
- multimedia
- internet protocol
- digital video
- multipath
- video frames
- visual features
- content delivery
- video content
- video streams
- real time
- transport layer
- multiple description coding