Login / Signup
AVA-AVD: Audio-visual Speaker Diarization in the Wild.
Eric Zhongcong Xu
Zeyang Song
Satoshi Tsutsui
Chao Feng
Mang Ye
Mike Zheng Shou
Published in:
ACM Multimedia (2022)
Keyphrases
</>
audio visual
speaker diarization
speaker verification
multi modal
visual information
emotion recognition
multi stream
audio features
speech recognition
visual data
multimedia
broadcast news
affective states
bayesian information criterion
pattern recognition
image classification
search engine