Convolutional Two-Stream Network Fusion for Video Action Recognition.
Christoph FeichtenhoferAxel PinzAndrew ZissermanPublished in: CVPR (2016)
Keyphrases
- action recognition
- human actions
- action classification
- video dataset
- spatial temporal
- action detection
- recognizing human actions
- recognition of human actions
- motion features
- computer vision
- activity recognition
- bag of words
- human activities
- human detection
- body parts
- static images
- space time interest points
- video sequences
- bag of features
- data streams
- spatio temporal
- view invariant
- motion capture data
- recognizing actions
- depth sensors
- video data
- video frames
- video streams
- mid level
- human pose
- video shots
- wireless sensor networks
- video content
- video surveillance
- action recognition in videos
- video clips