CASP-Net: Rethinking Video Saliency Prediction from an Audio-VisualConsistency Perceptual Perspective.
Junwen XiongGanglai WangPeng ZhangWei HuangYufei ZhaGuangtao ZhaiPublished in: CoRR (2023)
Keyphrases
- multimedia
- audio video
- digital video
- prediction accuracy
- video content analysis
- multimedia processing
- scene change detection
- visual data
- visual saliency
- audio files
- multimedia information
- video files
- video data
- video content
- human visual perception
- cross modal
- video sequences
- video streams
- audio stream
- video material
- video analysis
- audio features
- video database
- digital audio
- broadcast news
- video signals
- audio content
- lecture videos
- audio signals
- human visual system
- story segmentation
- media streams
- audio visual content
- low level
- visual information
- online video
- audio signal
- perceptual quality
- human perception
- space time
- soccer video
- multi modal
- audio visual
- video segmentation