Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection.
Pilhyeon LeeTaeoh KimMinho ShimDongyoon WeeHyeran ByunPublished in: CoRR (2023)
Keyphrases
- cross modal
- action detection
- multi modal
- action recognition
- atomic actions
- multimedia retrieval
- temporal information
- spatio temporal
- action classification
- object detection
- image retrieval
- multimedia databases
- visual data
- color space
- space time
- human actions
- temporal relations
- temporal structure
- human activities
- visual similarity
- low level