Login / Signup
Audio-visual saliency prediction for movie viewing in immersive environments: Dataset and benchmarks.
Zhao Chen
Kao Zhang
Hao Cai
Xiaoying Ding
Chenxi Jiang
Zhenzhong Chen
Published in:
J. Vis. Commun. Image Represent. (2024)
Keyphrases
</>
audio visual
multi modal
immersive environments
visual information
visual data
video summarization
multi stream
person authentication
temporal context
multimedia
virtual world
image retrieval
feature vectors
low level
visual content
audio visual speech recognition