Login / Signup
Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention.
Xubo Liu
Qiushi Huang
Xinhao Mei
Haohe Liu
Qiuqiang Kong
Jianyuan Sun
Shengchen Li
Tom Ko
Yu Zhang
H. Lilian Tang
Mark D. Plumbley
Volkan Kiliç
Wenwu Wang
Published in:
CoRR (2022)
Keyphrases
</>
visual attention
eye movements
eye tracking
visual information
multimedia
saliency map
vision system
visual search
natural scenes
visual perception
audio visual
focus of attention
visual data
visual scene
higher level
salient regions
image sequences