Contrastive Positive Sample Propagation along the Audio-Visual Event Line.
Jinxing ZhouDan GuoMeng WangPublished in: CoRR (2022)
Keyphrases
- audio visual
- multi modal
- visual information
- visual data
- video summarization
- person authentication
- temporal context
- emotion recognition
- multimedia
- audio visual speech recognition
- multi stream
- event detection
- multimedia data
- pose estimation
- visual features
- data processing
- information extraction
- low level
- image retrieval
- data analysis