Login / Signup

Self-Supervised Learning for Audio-Visual Speaker Diarization.

Yifan DingYong XuShi-Xiong ZhangYahuan CongLiqiang Wang
Published in: ICASSP (2020)
Keyphrases
  • audio visual
  • data sets
  • multi modal
  • image data