Convolutional Two-Stream Network Fusion for Video Action Recognition.
Christoph FeichtenhoferAxel PinzAndrew ZissermanPublished in: CoRR (2016)
Keyphrases
- action recognition
- human actions
- action classification
- spatial temporal
- video dataset
- recognizing human actions
- action detection
- recognition of human actions
- motion features
- static images
- activity recognition
- human detection
- bag of words
- human activities
- body parts
- space time interest points
- computer vision
- depth sensors
- recognizing actions
- data streams
- mid level
- video sequences
- motion capture data
- video streams
- space time
- spatio temporal
- human pose
- action primitives
- view invariant
- action recognition in videos
- machine learning
- d objects
- wireless sensor networks
- multimedia
- view invariant action recognition
- bag of features
- video analysis
- video clips
- video surveillance
- human motion
- video data