Login / Signup
System fusion and speaker linking for longitudinal diarization of TV shows.
Marc Ferras
Srikanth R. Madikeri
Petr Motlícek
Hervé Bourlard
Published in:
ICASSP (2016)
Keyphrases
</>
speaker diarization
tv shows
closed captions
human interactions
speaker identification
video clips
data fusion
speech recognition
human interaction
speaker verification
broadcast news
user interface
sensor networks
audio visual
automatic speech recognition