mRedditSum: A Multimodal Abstractive Summarization Dataset of Reddit Threads with Images.
Keighley OverbayJaewoo AhnFatemeh Pesaran zadehJoonsuk ParkGunhee KimPublished in: EMNLP (2023)
Keyphrases
- image database
- image analysis
- image dataset
- input image
- image data
- image classification
- image features
- edge detection
- images with ground truth
- image registration
- multimodal image registration
- sample images
- multiple images
- multi modal
- keypoints
- ground truth
- image matching
- lighting conditions
- image collections
- spatial information
- video sequences
- pixel values
- object recognition
- rigid body
- visual concepts
- three dimensional
- image retrieval
- feature vectors
- computer vision