DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition.
Yuxuan LiangPan ZhouRoger ZimmermannShuicheng YanPublished in: CoRR (2021)
Keyphrases
- recognition rate
- multimedia
- object recognition
- recognition accuracy
- human activities
- real time
- video sequences
- video data
- fuzzy logic
- action recognition
- video streams
- pattern recognition
- digital video
- character recognition
- image recognition
- video database
- real time video
- text detection
- automatic recognition
- video analysis
- recognition algorithm
- human actions
- spatial and temporal
- space time