Effectively Leveraging CLIP for Generating Situational Summaries of Images and Videos.
Dhruv VermaDebaditya RoyBasura FernandoPublished in: CoRR (2024)
Keyphrases
- image data
- input image
- ground truth
- image analysis
- image database
- digital photos
- multi view
- image regions
- test images
- rigid body
- edge detection
- lighting conditions
- image set
- image collections
- video images
- three dimensional
- image features
- image quality
- segmentation algorithm
- spatial information
- dynamic scenes
- video analysis