Sign in

Audio-visual aligned saliency model for omnidirectional video with implicit neural representation learning.

Dandan ZhuXuan ShaoKaiwei ZhangXiongkuo MinGuangtao ZhaiXiaokang Yang
Published in: Appl. Intell. (2023)
Keyphrases
  • audio visual
  • visual data
  • video summarization
  • multimedia
  • multi modal
  • multimedia data
  • spatio temporal
  • natural images
  • video data
  • visual information
  • key frames
  • video retrieval