Few-shot Action Recognition with Captioning Foundation Models.
Xiang WangShiwei ZhangHangjie YuanYingya ZhangChangxin GaoDeli ZhaoNong SangPublished in: CoRR (2023)
Keyphrases
- action recognition
- static images
- computer vision
- action classification
- bag of words
- human actions
- activity recognition
- human detection
- spatial temporal
- probabilistic model
- recognition of human actions
- depth sensors
- human pose
- body parts
- video sequences
- random fields
- human activities
- pose estimation
- object detection
- low level
- feature vectors
- recognizing human actions