Login / Signup
Target Active Speaker Detection with Audio-visual Cues.
Yidi Jiang
Ruijie Tao
Zexu Pan
Haizhou Li
Published in:
CoRR (2023)
Keyphrases
</>
visual cues
visual information
audio visual
low level
target detection
multiple visual cues
mid level
speaker identification
lecture videos
event detection
visual data
object detection
feature extraction
computer vision
stereo vision
visual features
context aware
soccer video
depth cues
audio stream