Spatio-Temporal Self-Supervision Enhanced Transformer Networks for Action Recognition.
Yongkang ZhangHan ZhangGuoming WuJun LiPublished in: ICME (2022)
Keyphrases
- action recognition
- human actions
- spatio temporal
- spatial temporal
- action recognition in videos
- spatio temporal interest points
- view invariant
- recognition of human actions
- human detection
- action classification
- bag of words
- activity recognition
- body parts
- recognizing human actions
- recognizing actions
- spatial and temporal
- bag of features
- image sequences
- video dataset
- action detection
- independent subspace analysis
- mid level
- static images
- human pose
- human activities
- depth sensors
- action primitives
- depth cameras
- human motion
- image retrieval