Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition.
Syed Talal WasimMuhammad Uzair KhattakMuzammal NaseerSalman KhanMubarak ShahFahad Shahbaz KhanPublished in: ICCV (2023)
Keyphrases
- action recognition
- human actions
- spatial temporal
- spatio temporal
- action classification
- spatio temporal interest points
- video database
- recognition of human actions
- video dataset
- video data
- motion features
- action detection
- space time
- video sequences
- activity recognition
- bag of words
- spatial and temporal
- static images
- view invariant
- human detection
- multimedia
- action primitives
- human motion
- human activities
- video content
- mid level
- video streams
- video frames
- motion history images
- recognizing human actions
- space time interest points
- video surveillance
- visual features
- event recognition
- temporal structure
- video images
- human pose
- bag of features
- video shots
- video clips
- recognizing actions
- sensor data
- machine learning