Login / Signup
Impact of visual assistance for automated audio captioning.
Wim Boes
Hugo Van hamme
Published in:
CoRR (2022)
Keyphrases
</>
visual information
cross modal
visual data
visual features
visual perception
visual cues
low level
semi automated
video indexing and retrieval
emotion recognition
audio visual
fully automated
visual representation
signal processing
feature extraction
speaker identification
lifelog
multimedia