Unveiling Hallucination in Text, Image, Video, and Audio Foundation Models: A Comprehensive Survey.
Pranab SahooPrabhash MehariaAkash GhoshSriparna SahaVinija JainAman ChadhaPublished in: CoRR (2024)
Keyphrases
- visual data
- image content
- video files
- input image
- image data
- multimedia
- image features
- image classification
- multiscale
- single image
- low level
- text graphics
- video images
- image retrieval
- multimedia processing
- image representation
- audio video
- digital video
- image segmentation
- visual information
- image database
- image collections
- edge detection
- video data
- web images
- low resolution images
- audio content
- video content analysis
- textual descriptions
- caption text
- keywords
- semantic labels
- segmentation method
- video search
- image regions
- video clips