Login / Signup
Self-Supervised Learning for Audio-Visual Speaker Diarization.
Yifan Ding
Yong Xu
Shi-Xiong Zhang
Yahuan Cong
Liqiang Wang
Published in:
ICASSP (2020)
Keyphrases
</>
audio visual
data sets
multi modal
image data