Cognitive resilience: Unraveling the proficiency of image-captioning models to interpret masked visual content.
Zhicheng DuZhaotian XieHuazhang YingLikun ZhangPeiwu QinPublished in: CoRR (2024)
Keyphrases
- visual content
- image content
- image collections
- image retrieval
- image representation
- content based image
- low level
- image features
- visual descriptors
- visual features
- visual appearance
- image classification
- image data
- input image
- spatial relationships
- textual and visual information
- image regions
- image matching
- visual concepts
- high resolution
- multiscale
- image database
- visual words
- relevance feedback
- probabilistic model
- video retrieval
- image processing
- textual and visual features
- high level
- spatial relations
- video content
- visual information
- co occurrence