ViViT: A Video Vision Transformer.
Anurag ArnabMostafa DehghaniGeorg HeigoldChen SunMario LucicCordelia SchmidPublished in: ICCV (2021)
Keyphrases
- video data
- real time
- computer vision
- video sequences
- vision system
- video streams
- video frames
- real time video
- multimedia
- video analysis
- video content
- fault diagnosis
- key frames
- online video
- video segmentation
- video database
- spatial and temporal
- fuzzy logic
- multimedia data
- event detection
- space time
- moving objects
- search engine
- digital video
- visual perception
- neural network
- data sets