Login / Signup
A multimodal approach to initialisation for top-down speaker diarization of television shows.
Simon Bozonnet
Félicien Vallet
Nicholas W. D. Evans
Slim Essid
Gaël Richard
Jean Carrive
Published in:
EUSIPCO (2010)
Keyphrases
</>
speaker diarization
multi modal
machine learning
high level
speech recognition
computer vision
audio visual