Open-Vocabulary Audio-Visual Semantic Segmentation.
Ruohao GuoLiao QuDantong NiuYanyu QiWenzhen YueJi ShiBowei XingXianghua YingPublished in: CoRR (2024)
Keyphrases
- audio visual
- semantic segmentation
- multi modal
- superpixels
- conditional random fields
- visual information
- weakly supervised
- scene classification
- visual data
- object categories
- object classes
- multimedia
- keywords
- principal component analysis
- high dimensional
- contextual information
- information retrieval
- image set
- multiscale
- three dimensional