Audio-Visual Instance Segmentation.
Ruohao GuoYaru ChenYanyu QiWenzhen YueDantong NiuXianghua YingPublished in: CoRR (2023)
Keyphrases
- audio visual
- temporal segmentation
- multi modal
- visual information
- visual data
- person authentication
- image segmentation
- temporal context
- video summarization
- multi stream
- audio visual speech recognition
- multimedia
- image regions
- multiscale
- emotion recognition
- image content
- spatio temporal
- high level
- three dimensional
- image processing