Login / Signup
Multimodal speaker diarization for meetings using volume-evaluated SRP-PHAT and video analysis.
Pablo Cabañas Molero
M. J. Lucena-Lopez
José Manuel Fuertes
Pedro Vera-Candeas
Nicolás Ruiz-Reyes
Published in:
Multim. Tools Appl. (2018)
Keyphrases
</>
speaker diarization
video analysis
video data
video processing
speech recognition
event detection
broadcast news
semantic concepts
video database
bayesian information criterion
machine learning
multi modal
sports video
soccer video
speaker identification
low level
hidden markov models
pattern recognition