TEAM-Net: Multi-modal Learning for Video Action Recognition with Partial Decoding.
Zhengwei WangQi SheAljosa SmolicPublished in: CoRR (2021)
Keyphrases
- multi modal
- action recognition
- spatial temporal
- human actions
- semantic concepts
- action classification
- video search
- action detection
- bag of words
- video dataset
- activity recognition
- static images
- video sequences
- high level
- recognizing actions
- multi modality
- multiple modalities
- cross modal
- multimedia
- video database
- multimedia data
- video data
- audio visual
- video frames
- medical images
- view invariant
- spatio temporal
- high dimensional