Login / Signup
MSDWild: Multi-modal Speaker Diarization Dataset in the Wild.
Tao Liu
Shuai Fan
Xu Xiang
Hongbo Song
Shaoxiong Lin
Jiaqi Sun
Tianyuan Han
Siyuan Chen
Binwei Yao
Sen Liu
Yifei Wu
Yanmin Qian
Kai Yu
Published in:
INTERSPEECH (2022)
Keyphrases
</>
multi modal
speaker diarization
audio visual
high dimensional
multi modality
machine learning
speech recognition
image annotation
feature extraction
semantic concepts
cross modal
pattern recognition
uni modal