Insights into Object Semantics: Leveraging Transformer Networks for Advanced Image Captioning.
Deema Abdal HafethStefanos D. KolliasPublished in: Sensors (2024)
Keyphrases
- region of interest
- image data
- multiscale
- single image
- image content
- image classification
- bounding box
- image analysis
- image representation
- input image
- image features
- lighting conditions
- spatial relationships
- image regions
- complex scenes
- pixel level
- high resolution
- object models
- multiple objects
- similar objects
- low level
- segmentation algorithm
- background clutter
- object localization
- position and orientation
- partial occlusion
- three dimensional objects
- spatial relations
- image segments
- keypoints
- test images
- image retrieval
- color images
- d objects
- neural network
- image set
- fault diagnosis
- target object
- semantic information
- edge detection
- image matching
- moving objects
- image sequences
- image segmentation
- image processing
- individual objects
- computer vision