Login / Signup
Target Active Speaker Detection with Audio-visual Cues.
Yidi Jiang
Ruijie Tao
Zexu Pan
Haizhou Li
Published in:
INTERSPEECH (2023)
Keyphrases
</>
visual cues
visual information
audio visual
low level
target detection
multimedia
visual features
visual data
object detection
mid level
multiple cues
audio stream
speech recognition
speaker identification
soccer video
speaker verification
multi modal
information retrieval
depth cues
multiple visual cues