Video Pose Distillation for Few-Shot, Fine-Grained Sports Action Recognition.
James HongMatthew FisherMichaël GharbiKayvon FatahalianPublished in: CoRR (2021)
Keyphrases
- fine grained
- action recognition
- action detection
- human actions
- sports video
- human pose
- mid level
- recognizing actions
- video shots
- video dataset
- video database
- action classification
- spatial temporal
- video sequences
- coarse grained
- video data
- video content
- motion features
- key frames
- bag of words
- static images
- activity recognition
- recognition of human actions
- human detection
- recognizing human actions
- access control
- video streams
- human activities
- video analysis
- computer vision
- body parts
- view invariant
- news video
- space time interest points
- visual features
- event detection
- video clips
- motion history images
- pose estimation
- video indexing
- visual content
- object detection
- d objects
- atomic actions
- video retrieval
- video frames
- human body
- human motion
- markov random field
- data lineage
- high level