Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images.
Nitzan Bitton GuettaYonatan BittonJack HesselLudwig SchmidtYuval EloviciGabriel StanovskyRoy SchwartzPublished in: CoRR (2023)
Keyphrases
- image database
- ground truth
- three dimensional
- image registration
- image understanding
- input image
- image data
- image classification
- original images
- rigid body
- quantitative evaluation
- real time
- image analysis
- image retrieval
- multiple images
- image collections
- image features
- real world
- lighting conditions
- test images
- computer graphics
- image set
- object recognition
- edge detection
- image segmentation
- keypoints
- image regions
- image matching
- segmentation algorithm
- region of interest
- vision system
- language learning
- similarity measure
- computer vision
- d objects