Spatio-Temporal Self-Supervision Enhanced Transformer Networks for Action Recognition.

Yongkang Zhang Han Zhang Guoming Wu Jun Li

Published in: ICME (2022)

Keyphrases

action recognition
human actions
spatio temporal
spatial temporal
action recognition in videos
spatio temporal interest points
view invariant
recognition of human actions
human detection
action classification
bag of words
activity recognition
body parts
recognizing human actions
recognizing actions
spatial and temporal
bag of features
image sequences
video dataset
action detection
independent subspace analysis
mid level
static images
human pose
human activities
depth sensors
action primitives
depth cameras
human motion
image retrieval