Login / Signup
Modeling audio directional statistics using a probabilistic spatial dictionary for speaker diarization in real meetings.
Mahmoud Fakhry
Nobutaka Ito
Shoko Araki
Tomohiro Nakatani
Published in:
IWAENC (2016)
Keyphrases
</>
neural network
speaker diarization
audio stream
broadcast news
speech recognition
bayesian information criterion
spatial information
audio visual
speaker identification
bayesian networks
pattern recognition
video streams
video retrieval
multimedia
probabilistic model
model selection