Cross-modal attention guided visual reasoning for referring image segmentation.
Wenjing ZhangMengnan HuQuange TanQianli ZhouRong WangPublished in: Multim. Tools Appl. (2023)
Keyphrases
- cross modal
- image segmentation
- perceptual information
- multi modal
- multimedia retrieval
- image retrieval
- knowledge base
- graph cuts
- multimedia databases
- visual data
- visual recognition
- visual similarity
- image processing
- markov random field
- computer vision
- image understanding
- image data
- visual features
- multi label
- feature extraction