Vision Transformer Based Model for Describing a Set of Images as a Story.
Zainy M. MalakanGhulam Mubashar HassanAjmal MianPublished in: CoRR (2022)
Keyphrases
- image set
- probability distribution
- small number
- three dimensional
- ground truth
- input data
- high level
- hierarchical structure
- image data
- test images
- image analysis
- statistical model
- sample images
- viewing angle
- computational model
- edge detection
- object recognition
- image classification
- probabilistic model
- image matching
- expert systems
- video sequences
- computer vision
- neural network