TSIC-CLIP: Traffic Scene Image Captioning Model Based on Clip.
Hao ZhangCheng XuBingxin XuMuwei JianHongzhe LiuXuewei LiPublished in: Inf. Technol. Control. (2024)
Keyphrases
- input image
- single image
- image features
- scene understanding
- image regions
- image representation
- image segmentation
- image retrieval
- scene images
- low level
- image based rendering
- complex scenes
- image content
- d scene
- geometric information
- low level features
- multiple objects
- reference images
- location and orientation
- multiscale
- real world scenes
- camera images
- image motion
- imaging process
- outdoor scenes
- relative position
- piecewise planar
- intensity images
- multiple images
- geometric constraints
- image set
- image data
- image classification
- visual data
- scene matching
- high resolution
- spatial relations
- scene geometry
- dense depth map
- video sequences
- moving objects
- image segments
- segmentation method
- cast shadows
- spatial information
- image matching
- key frames
- depth estimation
- video clips
- pixel values
- computer vision
- aerial images