Login / Signup
DyViSE: Dynamic Vision-Guided Speaker Embedding for Audio-Visual Speaker Diarization.
Abudukelimu Wuerkaixi
Kunda Yan
You Zhang
Zhiyao Duan
Changshui Zhang
Published in:
MMSP (2022)
Keyphrases
</>
audio visual
speaker diarization
vision guided
speaker verification
multi modal
mobile robot navigation
visual information
visual data
multimedia
speech recognition
natural scenes
emotion recognition
machine learning
mobile robot
speaker identification