Multiview Transformers for Video Recognition.
Shen YanXuehan XiongAnurag ArnabZhichao LuMi ZhangChen SunCordelia SchmidPublished in: CVPR (2022)
Keyphrases
- recognition rate
- human activities
- video data
- recognition accuracy
- object recognition
- video sequences
- real time video
- pattern recognition
- video frames
- real time
- digital video
- automatic recognition
- depth video
- activity detection
- recognition algorithm
- visual recognition
- video streams
- multimedia data
- activity recognition
- feature extraction
- multimedia
- video content
- multiple views
- video retrieval
- video clips
- space time
- dynamic scenes
- static images
- multi view
- video based face recognition
- text detection
- handwritten characters
- recognition process
- video database
- video analysis
- hand gestures
- image recognition
- character recognition
- partial occlusion
- motion estimation
- image retrieval
- three dimensional