Explicit Image Caption Reasoning: Generating Accurate and Informative Captions for Complex Scenes with LMM.
Mingzhang CuiCaihong LiYi YangPublished in: Sensors (2024)
Keyphrases
- complex scenes
- computer graphics
- image data
- visual features
- real world scenes
- photorealistic
- multiple objects
- multiscale
- image features
- single image
- image collections
- image classification
- image retrieval
- image content
- image search
- test images
- image segmentation
- virtual environment
- image representation
- high resolution
- high quality
- image processing
- lighting conditions
- input image
- object recognition
- complex background
- image synthesis
- video tracking